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Preface 


This Gentle Introduction is a radical revision, as yet very partial, of my earlier 
‘Notes on Basic Category Theory’ (available here). 

The gadgets of basic category theory fit together rather beautifully in multiple 
ways. Their intricate interconnections mean, however, that there isn’t a single 
best route into the theory. Different lecture courses, different books, can quite 
reasonably take topics in very different orders, all illuminating in their different 
ways. In the earlier Notes, I roughly followed the order of somewhat over half 
of the Cambridge Part III course in category theory, as given in 2014 by Rory 
Lucyshyn- Wright (see also Julia Goedecke’s notes from 2013). We now proceed 
rather differently. The old ordering has its rationale; but I think that the new 
ordering has a greater logical appeal. Hence the rewrite. 

The topics we will eventually cover, in different arrangements, are also the 
topics of (for example) all but the last chapter of Awodey’s good but uneven 
Category Theory and of the whole of Tom Leinster’s terrific - and appropriately 
titled - Basic Category Theory. But then, if there are some rightly admired 
texts out there, not to mention various sets of notes on category theory available 
online (see here), why produce another introduction to category theory? 

I didn’t intend to! My goal all along has been to get to understand what light 
category theory throws on logic, set theory, and the foundations of mathematics. 
But I realized that I needed to get a lot more securely on top of basic category 
theory if I was eventually to pursue these more philosophical issues. So my earlier 
Notes began life as detailed jottings for myself, to help really fix ideas: and then 
- as can happen - the writing has taken on its own momentum. 

What remains distinctive about this Gentle Introduction, for good or ill, is that 
it is written by someone who doesn’t pretend to be a fully-formed expert who 
usually operates at the frontiers of research in category theory. I hope, however, 
that this makes me rather more attuned to the likely needs of (at least some) 
beginners. That’s why I go rather slowly over ideas that once gave me pause, 
and I have generally tried to be as clear as possible. Despite the length, however, 
the coverage in this present version still falls somewhat short even of the books 
by Awodey and Leinster, let alone the full Part III syllabus. However, I hope 
what is here will prove useful to others starting to get to grips with category 
theory. My own experience certainly suggests that initially taking things at a 
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Preface 


gentle pace as you work into a familiarity with categorial ways of thinking makes 
later adventures exploring beyond the basics so very much more manageable. 



1 Introduction 


This is in lieu of more expansive introductory chapters. I really ought one day 
to provide a lot more preliminary scene-setting. In particular, I eventually want 
to write at some length about the concept of a mathematical structure, and also 
about the non-set-theorist’s everyday working conception of sets. But that will 
have to wait. In putting together the present version of this Gentle Introduction, 
my first concern is to get the presentation of some basic category theory into 
better shape. So I’m going to be very brisk in this chapter - but I hope forgivably 
so for now. 


1.1 Why categories? 

Here is a fundamental insight: we can think of a family of mathematical struc- 
tures equipped with structure-preserving maps/morphisms/functions between 
them as itself forming a further mathematical structure. 

For example: a particular group is a structure which comprises some objects 
equipped with an operation on them obeying certain familiar axioms. But we 
can also think of a family of such groups, taken together with maps between 
them (the homomorpliisms which preserve group structure) as forming a further 
structure-of-structures. 

We can then investigate such structures-of-structures, and indeed go on to 
think about structure-preserving maps - or as they say, functors - between them. 
Going up another level, we can talk in turn about maps between such functors. 

These layers of increasing abstraction are the topic of category theory. So if 
modern mathematics already abstracts (moving, to take another kind of exam- 
ple, from concrete geometries to abstract metric and topological spaces), cate- 
gory theory abstracts again, and then again. What, if anything, do we gain by 
going up another level or two of abstraction? To quote Tom Leinster, 

Category theory takes a bird’s eye view of mathematics. From high 
in the sky, details become invisible, but we can spot patterns that 
were impossible to detect from ground level. (Leinster, 2014, p. 1) 

Which is already reason enough for mathematicians, or at least those with a 
certain generalizing cast of mind, to be interested in category theory. 
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What about others? Well, category theory can be thought of as a kind of 
generalized theory of maps or functions. It is no surprise then that it should 
turn out to have close links to the lambda calculus, which is another kind of 
generalized theory of functions. So that’s one reason - among a number - why 
category theory turns out to be of considerable interest to logicians and theoretic 
computer scientists (who are already concerned with matters around and about 
the lambda calculus). 

Again, philosophers - or at least those interested in questions about the sense 
in which modern pure mathematics is a study of abstract structures - will want to 
know what light category theory sheds on these questions. Rather more generally, 
the theory should perhaps appeal to a certain philosophical temperament. Many 
philosophers, pressed for a short characterization of their discipline, like to quote 
a famous remark by the profound American philosopher Wilfrid Sellars, 

The aim of philosophy, abstractly formulated, is to understand how 
things in the broadest possible sense of the term hang together in 
the broadest possible sense of the term. (Sellars, 1963, p. 1) 

This conception sees philosophy, not as the self-indulgent pursuit of linguistic 
games or as providing obscurely edifying discourse, but as continuous with other 
serious enquiries, seeking to develop an overview of how their various subject- 
matters are interrelated. Category theory has a claim to provide part of that 
overview, as it explores the logical geography of aspects of abstract mathematics. 

There are solid reasons, then, for theorists pursuing various disciplines to take 
an interest in category theory. However, some advocates have made much grander 
claims for the novelty and fundamental importance of categorial insights. For 
example, some proponents have suggested that category theory should replace 
set theory in the role of, so to speak, the official foundations of mathematics. I 
certainly make no such claims here at the outset. After all, we can only be in a 
position to assess the ambitions of such enthusiasts after we have made a decent 
start on understanding category theory for its own sake. 


1.2 What do you need to bring to the party? 

Legend has it that over the doorway to Plato’s Academy was written ‘Let no 
one ignorant of mathematics enter here’. The doorway to category theory should 
perhaps be similarly inscribed. After all, you will not be well placed to appreciate 
how category theory gives us a story about the way different parts of modern 
abstract mathematics hang together if you don’t already know some modern 
abstract mathematics. 

But in fact, in this Gentle Introduction, we need not presuppose very much 
background. We’ll simply take it that you have some very fragmentary knowledge 
e.g. of what a group is, perhaps of what a topological space is, and some similar 
bits and pieces. And if a few later illustrative examples pass you by, don’t panic. 
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1.3 Theorems as exercises 


I usually try to give multiple examples of important concepts and constructs; so 
simply skip those examples that don’t work for you. 

In due course, I hope to say rather more about the background that here we 
have to take for granted. For example, it would be illuminating and relevant 
to explore just how much set theory is or isn’t presupposed by mathematicians 
working on (say) group theory or topology. After all, a group is said to be 
a set equipped with an operation obeying certain axioms (and conventionally, 
operations or functions are also identified with sets): but just how seriously do 
we need to take this conventional set talk? However, such questions - as I have 
already indicated - are for a later version of this Gentle Introduction. For now, 
we will just have to assume that groups, topological spaces and the rest are 
abstractions in good standing and see where we get. 

1.3 Theorems as exercises 

There are currently no exercises in what follows - or at least, there are none 
explicitly labeled as such. Maybe that’s another omission to be rectified later. 

However, almost all the proofs of theorems in basic category theory are easy, 
and often they are very easy indeed. Even when a result like the Yoneda Lemma 
takes a bit more work, I try to break things down using intermediate lemmas so 
that each stage is as straightforward to prove as possible. 

Few of our theorems, then, require much more than straightforwardly apply- 
ing category-theoretic definitions and/or earlier results in the book (perhaps 
together with tiny amounts of background knowledge). There’s little ingenious 
trickery. So as a reader you can think of almost every theorem as in fact pre- 
senting you with an exercise which you could, even should, attempt in order 
to fix ideas. For a few tougher theorems, I sometimes give hints about how the 
argument goes. The ensuing proof which I spell out is then the answer (or at 
least, an answer) to the exercise. I almost always take these proofs at a gentle 
pace, as befits a Gentle Introduction. 

1.4 Notation and terminology 

I try to keep notation and terminology standard, and where there are standard 
alternatives often mention them. 

But I should mention at the outset one minor idiosyncracy. As well as using 
the familiar □ as an end-of-proof marker, I use <1 as an end-of-defmition marker. 
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2 Categories defined 


We are going to be investigating the general idea of a family of structures with 
structure-preserving maps between them: such a family is the paradigm of a 
category. 

But what can we say about a family-of-structures in the abstract? One general 
thought is this: if, within a family, we have a structure-preserving map from A 
to B , and another structure-preserving map from B to C, then we should be 
able to compose these maps, follow the first by the second, to give a structure- 
preserving map from A to C . What principles should govern such composition 
of maps? Associativity, surely: given maps 

A — > B — > C — > D 

it ought not matter how we carve up the journey from A to D. It ought not 
to matter whether we apply / followed by the composition g-followed-by-/i, or 
alternatively apply the composite /-followed-by-g and then afterwards apply h. 

What else can we say at the same level of stratospheric generality about 
structure-preserving maps? Very little indeed. Except that there presumably 
will always be the limiting case of a ‘do nothing’ identity map, which applied to 
A leaves it untouched. 

That would seem not to give us very much to go on: but in fact it is already 
enough to guide our initial very abstract definition of categories as families-of- 
structures in this chapter. We then present some initial examples, and introduce 
categorial diagrams . 1 

2.1 The very idea of a category 

A category ^ consists in certain data (in the sense of ‘givens’, rather than of 
‘information’ !), with the data governed by a couple of axioms. 

Definition 1 . The data for a category ^ come in two distinct, non-overlapping, 
sorts: 


1 Logicians already have a quite different settled use for ‘categorical’. So when talking about 
categories, I much prefer the adjectival form ‘categorial’, even though it is the minority usage. 
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2.1 The very idea of a category 


(1) Objects (which we will typically notate by l A\ ‘ B\ ‘C\ . . . ). 

(2) Arrows (which we typically notate by ‘/’, ‘g\ ( h\ . . . ). 

Further, 

(3) For each arrow /, there are unique associated objects src(f) and tar(f), 

f 

respectively the source and target of /. We write ‘/: A — > FT or l A — > B ’ 
to notate that / is an arrow with src(f) = A and tar(f) = B. 

(4) For any two arrows /: A — ► B, </: B — >• C, where src(g) = tar(f), there 
exists an arrow g o f: A —> C, l g following /’, which we call the composite 
of / with g. 

(5) Given any object A , there is an arrow 1 A : A — > A called the identity arrow 
on A. 

The axioms then require 

Identity. The identity arrows do behave like identities! So for any /: A — > B 
we have fol A = f=l B of. 

Associativity. For any / : A — ► B, g: B — ► C, h: C -» D, we have ho (g o f) = 
(hog)o f. <1 

Five quick remarks on our terminology and notation: 

(a) The labels ‘objects’ and ‘arrows’ for the two kinds of data are standard. But 
note that the objects of a category (the first type of data) are very often 
not ‘naked’ objects, so to speak, but rather - as promised are typically 
entities that come equipped with certain operations and relations etc., i.e. 
can be thought of as structures. 

(b) It should also be noted, though, that the ‘objects’ in categories needn’t 
always be objects at all in the logician’s strict sense, i.e. in the sense which 
contrasts objects with entities like relations or functions. There are cate- 
gories whose objects - in the sense of the first type of data - are actually 
relations, and other categories whose objects are functions. 

(c) Borrowing familiar functional notation ‘/ : A — > B’ for arrows in categories 
is entirely natural given that arrows in many categories are (structure- 
preserving) functions: indeed, that’s the paradigm case. But as we’ll soon 
see, not all arrows are functions. Which means that not all arrows are 
morphisms either, in the usual sense of that term. Which is why I rather 
prefer the colourless ‘arrow’ to the equally common term ‘morphism’ for 
the second sort of data in a category. (Not that that will stop me often 
talking of morphisms or maps when context makes that natural!) 

(d) In keeping with the functional paradigm, the source and target of an arrow 
are very often called, respectively, the ‘domain’ and ‘codomain’ of the ar- 
row. But again that usage has the potential to mislead when arrows aren’t 
(the right kind of) functions, which is again why I prefer our terminology. 
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(e) Finally, note the order in which we write the components of a compos- 
ite arrow (some from computer science do things the other way about). 
Our standard notational convention is again suggested by the functional 
paradigm. In a category where f : A — ► B, q: B — » C are both functions, 
then go f(x) = g(f{x)). 

Now, our axioms suffice to ensure our first mini-result: 

Theorem 1. Identity arrows on a given object are unique; and the identity arrows 
on distinct objects are distinct. 

Proof. Suppose A has identity arrows 1 A and \' A . Then applying the identity 
axioms, 1 A = 1 A ° 1(4 = 1 ( 4 . 

For the second part, we simply note that A ^ B entails src(l A ) ^ src(ls) 
which entails 1 A ^ 1 b- □ 

(As this illustrates, I will cheerfully call the most trivial of lemmas, the run-of- 
the-mill propositions, the interesting corollaries, and the weightiest of results all 
‘theorems’ without distinction.) 

So every object in a category is associated with one and only one identity 
arrow. And we can in fact pick out such identity arrows by the special way they 
interact with all the other arrows. Hence we could in principle give a variant def- 
inition of categories which initially deals just in terms of arrows. For an account 
of how to do this, see Adamek et al. (2009, pp. 41-43). But I find this technical 
trickery rather unhelpful. (The central idea of category theory is perhaps best 
understood as the idea that we should probe objects by considering the mor- 
phisrns between them; but that surely need not mean writing the objects out of 
the story altogether!) 

2.2 Examples 

Naively, we might think of a structure as comprising some objects (plural) 
equipped with functions and/or relations on them. But a structure nowadays 
is much more commonly thought of as being a set (a single thing) equipped 
with functions and/or relations on it. But is there a difference? If so, does it 
matter? 

As we said in the Introduction, we are having to shelve such questions for 
now. So, going along with the usual modern line on structures, we will expect 
some paradigm examples of categories to have as objects sets-equipped-with- 
sorne- functions/relations, and then the arrows between such objects will then 
be suitable functions between the carrier sets which in a good sense ‘preserve 
structure’. Or perhaps we should say ‘respect structure’, for ‘preservation’ sounds 
like a matter of producing a full copy, which is more than we usually require. 

We start with an extremal case, where the sets in fact come with no additional 
structure: 
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2.2 Examples 


(1) Set is a category with 

Objects: all the sets (just naked sets, equipped with no extra struc- 
ture). 

Arrows: given sets A', Y, any (total) set-function / : X — > Y is an 
arrow. 

There’s an identity function on any set; set-functions / : A — >■ B, g: B — > C 
(where the source of g is the target of /) compose; and the axioms for being 
a category are evidently satisfied. 

Four comments on this initial case, (i) Annoyingly, there is already a substan- 
tive issue about just what category we have in mind here when talking about Set. 
What kind of sets are its objects? Pure sets, or do we allow sets with urelements 
(members which aren’t themselves sets)? Are these sets as constrained by e.g. 
the axioms of ZFC or are we thinking of a universe of sets better described by a 
rival set theory like NFU? How safe is it to talk about all the sets? 

Again, perhaps we should not pause over such questions now, or we’ll never 
get started! Take your favoured conception of the universe of sets (or it you think 
that universe is indeterminately open-ended, consider levels of the set universe 
up to some suitable ‘inaccessible’ rank): then its objects and functions should 
constitute a category. So for the moment, you can just interpret talk of sets 
in your preferred way. For some more introductory remarks, see here. And do 
eventually take a look at Leinster 2014, Ch. 3, ‘Interlude on sets’. But here let’s 
simply press on! 

(ii) Note that arrows in Set, like any arrows, must come with determinate 
targets/ co domains. But the standard way of treating functions set-theoretically 
is simply to identify a function / with its graph /, i.e. with the set of pairs ( x , y) 
such that f(x ) = y. This definition is lop-sided in that it fixes the function’s 
source/domain, the set of first elements in the pairs, but it doesn’t determine 
the function’s target. For a trivial example, consider the zero functions 2 : N — > N 
and z ' : N — > {0} where both send every number to zero: same graphs, different 
targets. 

Perhaps set theorists themselves ought to identity a set-function /: A — > B 
with a triple (A, /, B). But be that as it may, that’s how category theorists can 
officially regard arrows f\A—*B in Set. (Still, having made the point, it will 
be harmless for most purposes to casually talk of set-functions, arrows in Set, as 
if they are just sets of pairs.) 

(iii) We should remind ourselves why there is an identity arrow for 0 in Set. 
Vacuously, for any target set Y , there is exactly one set-function / : 0 — >• Y, i.e. 
the one whose graph is the empty set. Hence in particular there is a function 
10 : 0 — t 0. 

Note that in Set, the empty set is in fact the only set such that there is exactly 
one arrow from it to any other set. This gives us a nice first example of how we 
can characterize a significant object in a category not by its internal constitution, 
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so to speak, but by what arrows it has to and from other objects. We will return 
to this. 

(iv) The function id a- A — > A defined by idA^x ) = x will evidently serve in 
the category Set as the (unique) identity arrow 1 , 4 . 

We can’t say that, however, in pure category-speak. Still, we can do something 
that comes to the same. Glancing ahead to ideas which we’ll again return to, 
note first that we can define singletons in Set by relying on the observation that 
there is exactly one arrow from any object to a singleton. So fix on a singleton, 
call it simply ‘1’. Then consider the possible arrows (i.e. set-functions) x: 1 — > A. 
There is a one-one correspondence between these arrows and the elements x € A. 
So we can think of talk of arrows x: 1 — > A as our category-speak surrogate 
for talking about elements x of A. Then instead of saying idA(x) = x for all 
members x of A , we can say that for any arrow x : 1 — > A, we have 1 a 0 x = x. 
(More on this sort of thing in due course: but it gives us another glimpse ahead 
of how we might trade in talk of sets-and-their-elements for categorial talk of 
sets-and-arrows-between-them. ) 

(2) To continue with our examples of categories, there is also a category FinSet 
of hereditarily finite sets (i.e. sets with at most finite numbers of members, 
these members in turn having at most finite numbers of members, which 
in turn . . . ) and the functions between them. 

(3) Pfn is the category of sets and partial functions. Here, the objects are 
naked sets again, but an arrow / : A — > B is a function not necessarily 
everywhere defined on A (one way to think of such an arrow is as a total 
function / : A' B where A' C A). Given arrows-qua-partial-functions 
/ : A — ► B 1 g: B — ► C, their composition g o f: A — > C is defined in the 
obvious way, though you need to check that this indeed makes composition 
associative. 

(4) Set* is a category (of ‘pointed sets’) with 

objects: all the non-empty sets, each set A equipped with a zero-place 
function that picks out a distinguished object *a in the set, 

arrows: for A, B among the sets, any (total) function / : A — > B which 
maps the distinguished object *a to the distinguished object *b is an 
arrow. 

As we’ll show later, Pfn and Set* are in a good sense equivalent categories (can 
you already see why we should expect that)? 

Now, picking out a single distinguished object, as in Set*, is about the least 
structure we can impose on a set. We turn next to consider cases of sets with 
rather more structure: 

(5) Recall the usual definition of a monoid (M, -,1 M ). This is a pointed set 
M with a distinguished element 1m, equipped with a two-place ‘multipli- 
cation’ function mapping elements to elements. It is required that (i) this 
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function is associative, i.e. for all elements a, 6, c £ M , (a ■ b) ■ c = a ■ (b ■ c), 
and (ii) the designated element 1 m acts as a unit, i.e. is such that for any 
a € M, 1 m ■ a = a = a ■ 1 m- Then Mon is a category with 

objects: all the monoids, 

arrows: for (M, •, 1m), {N, x, Ijv) among the monoids, any monoid 
homomorphism f:M — > N is an arrow. Recall, f: M — N is a 
monoid homomorphism when it preserves monoidal structure, i.e. for 
any x, y € M, f(x-y) = f(x) x f(y ), and / preserves identity elements, 
he. /(1m) = Iat- 

In this category, the identity arrow on (M,-, 1m) is the identity function 
on the carrier set M (which is trivially a homomorphism from the monoid 
to itself); and composition of arrows is, of course, composition of homo- 
morphisms. 

(6) Mon is just the first of a family of similar algebraic cases, where the objects 
are sets equipped with some functions and the arrows are maps preserving 
that structure. We also have: 

a) Grp, the category of groups (the family of structures we mentioned 
at the very outset - except we now officially think of a group as 
comprising a set of objects equipped with a suitable operation), with 

objects of Grp: all the groups, 

arrows: group homomorphisms (preserving group structure) 

b) Ab, the category with 

objects: all abelian groups, 

arrows: group homomorphisms. 

c) Rng, the category with 

objects: all rings, 

arrows: ring homomorphisms. 

d) Bool, the category with 

objects: all Boolean algebras, 

arrows: structure preserving maps between these algebras. 

And so it goes! 

(7) The category Rel again has naked sets as objects, but this time an arrow 
A — >■ B in Rel is (not a function but) any relation R between A and B. We 
can take this officially to be a triple (A, R, B ), where the graph R C Ax B 
is the set of pairs (a, b) such that aRb. 

The identity arrow on A is then the diagonal relation with the graph 
{(a, a) | a € A}. And S o R : the composition of arrows R: A — > B and 
S : B — » C, is defined by requiring a SoR c if and only if 3b{aRb A bSc). It 
is easily checked that composition is associative. 

So here we have our first example where the arrows in a category are 
not functions. 
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(8) The monoids as objects together with all the monoid homomorphisms as 
arrows form a (very large!) category Mon. But now observe the impor- 
tant fact that any single monoid taken just by itself can be thought of as 
corresponding to a (perhaps very small) category. 

Thus take the monoid (M, -,1 m)- Define 46 to be the corresponding 
category with data 

i. the sole object of 46 \ just some object - choose whatever you like, 
and dub it V. 

ii. arrows of : the elements of the monoid M. 

iii. the identity arrow 1* of 46 is the identity element 1 m of the monoid 
M. 

iv. the composition too n: * — > * of two arrows m: * — » * and n: * — > * 
(those arrows being just the elements to, n € M), is to- • n. 

It is trivial that the axioms for being a category are satisfied. So we can 
think of a monoid as a one-object category. (Conversely, then, we can think 
of categories as, in a sense, generalized monoids.) 

Note in this case, unless the elements of the original monoid M are 
themselves functions, the arrows of the associated category 46 are again 
not themselves functions or morphisms in any ordinary sense. 

(9) Ord is a category, with 

objects: the pre-ordered sets. Recall, a set S is pre-ordered iff equipped 
with an order =<: where for all x, y € S, x =4 x, and x =4 y Ay =4 z — > 
x =4 z. We represent the resulting pre-ordered set (S', =^). 
arrows: monotone maps - i.e. maps / : S — > T, from the carrier set of 
(S, =4) to the carrier set of (T, C), such that if x =4 y then f(x) C f(y)- 

(10) Note too that any single pre-ordered set can also itself be regarded as a 
category (this time, a category with at most one arrow between any two 
objects). For corresponding to the pre-ordered set (S,=4), we will have a 
category S*, where 

i. The objects of are just the members of S. 

ii. An arrow from from source C to target D is just an ordered pair 
of objects (C, D) such that C =4 D. 

iii. 1 c = (C,C). 

iv. Composition is defined by setting ( D , E) o (C, D) = (C, E). 

It is easily checked that this satisfies the identity and associatively axioms. 
Conversely, any category 5^ whose objects form a set S and where there 
is at most one arrow between objects can be regarded as a pre-ordered set 
(S, =4) where for C, D £ S, C =4 D just in case there is an arrow from C to 

d ofy. 

We can thus call a category with at most one arrow between objects a 
pre-order category. 
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(11) A closely related case to Ex. (9): Pos is a category with 

objects: the posets - S' is a poset iff equipped with a partial order 
=$, i.e. with a pre-order which satisfies the additional anti-symmetry 
constraint, i.e. for x, y € S, x y A y =4 x — ► x = y. 

arrows: monotone maps. 

And exactly as each pre-ordered set can be regarded as category, so each 
individual poset can be regarded as a category, a poset category. 

(12) Top is a category with 

objects: all the topological spaces, 
arrows: the continuous maps. 

(13) Met is a category with 

objects: metric spaces, which we can take to be a set of points S 
equipped with a real metric d, 

arrows: the non-expansive maps, where - in an obvious shorthand no- 
tation - / : (S,d) — ► (T, e) is non-expansive iff d(x, y) > e(f(x), f(y))- 

(14) Vectk is a category with 

objects: vector spaces over the field k (each such space is a set of 
vectors, equipped with vector addition and multiplication by scalars 
in the field k), 

arrows: linear maps between the spaces. 

For those who already know about such beasts as e.g. sheaves, schemes, or simpli- 
cial sets, there are categories of those too, in each case with the relevant objects 
equipped with predictable structure-preserving maps as arrows. But we won’t 
pause over such exotica: instead let’s finish with a few much simpler cases: 

(15) For any collection of objects M, there is a discrete category on those ob- 
jects. This is the category whose objects are just the members of M, and 
which has as few arrows as possible, i.e. just the identity arrow for each 
object in M. 

(16) The smallest discrete category is 1 which has exactly one object and one 
arrow (the identity arrow on that object). Let’s picture it in all its glory! 


But should we talk about the category 1? Won’t different choices of 
object make for different one-object categories? Well, yes and no! We 
can have, in our mathematical universe, different cases of single objects 
equipped with an identity arrow - but they will be indiscernible from within 
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Categories defined 


category theory. So as far as category theory is concerned, they are all ‘the 
same’. 

Compare a familiar sort of case from elsewhere in mathematics. There 
will be many concrete groups which have the right structure to be e.g. a 
Klein four-group. But they are group-theoretically indiscernible by virtue 
of being isomorphic. So we take them, for many purposes, to be ‘the same’ 
and talk of the Klein four-group. (We’ll say more about this shortly.) 

(17) And having mentioned the one-object category 1 here’s another very small 
category, this time with two objects, the necessary identity arrows, and 
one further arrow between them. We can picture it like this: 

C ± • » * p> 

Call this category 2. We can think of the von Neumann ordinal 2, i.e. 
the set {0, {0}}, as giving rise to this category when it is considered as a 
poset with the subset relation as the ordering relation. Other von Neumann 
ordinals, finite and infinite, similarly give rise to other poset categories. 

And that will do for the moment as an introductory list. There is no shortage 
of categories, then! 

Indeed we might well begin to wonder whether it is just too easy to be a 
category. If such very different sorts of structures as a particular poset on the 
one hand and the whole universe of topological spaces on the other hand equally 
count as categories, how much mileage can there be general theorizing about 
categories and their interrelations? Well, that’s exactly what we hope to see 
over the coming chapters. 

2.3 Diagrams 

We can graphically represent categories - and in particular, represent facts about 
the equality of arrows - in a very natural way, using so-called commutative dia- 
grams. We’ve just seen a couple of trivial mini-examples. We’ll be using diagrams 
a great deal: so we’d better say something about them straight away. 

Talk of diagrams is in fact used by category theorists in three different (but 
very closely related) ways. For the moment, we’ll mention two. 

Definition 2. A representational diagram is a ‘graph’ with nodes representing 
objects from a given category and drawn arrows between nodes representing 
arrows of . Nodes and drawn arrows are usually labelled. 

Two nodes in a diagram can be joined by zero, one or more drawn arrows. A 
drawn arrow labelled ‘/’ from the node labeled ‘A’ to the node labeled ‘B’ of 
course represents the arrow /: A — ► B of . There can also be arrows looping 
from a node to itself, representing the identity arrow on an object or some other 
‘endomorphism’ (i.e. other arrow whose source and target is the same). <1 
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2.3 Diagrams 


Definition 3. A diagram in a category Y? is what is represented by a represen- 
tational diagram - i.e. is some ^-objects and ^-arrows between them. <1 

I’m being a little pernickety in distinguishing the two ideas here, the diagram-as- 
picture, and the diagram-as-what-is-pictured. But having made the distinction, 
we will rarely need to fuss about it, and can let context determine a sensible 
reading of claims about diagrams. 

An important point is that diagrams (in either sense) needn’t be full. That is 
to say, a diagram-as-a-picture need only show some of the objects and arrows 
in a category; and a diagram-as-what-is-pictured need only be a fragment of the 
category in question. 

Now, within a drawn diagram, we may be able to follow a directed path 
through more than two nodes, walking along the connecting drawn arrows (from 
source to target, of course). So a path in a category diagram from node A to 
node E (for example) might look like this 

A^B^-C-^AD^E 

And then we’ll call the composite arrow j o h o g o f the composite along the 
path. (We know that the composite must exist, and also that because of the 
associativity of composition we needn’t worry about the order of bracketing here. 
Indeed, henceforth we freely insert or omit brackets, doing whatever promotes 
local clarity.) Then: 

Definition 4. A category diagram commutes if (i) for any two directed paths 
along edges in the diagram from a node X to the distinct node Y, the composite 
arrow along the first path is equal to the composite arrow along the second path, 
and (ii) for any closed path with more than one node that loops around from a 
node X to itself, the composite arrow along that path is equal to the identity 
arrow on the object at X. < 

The main clause here is (i): it is useful to add (ii) to cover a few additional 
special cases. 

Hence, for example, the associativity law can be represented by saying that 
the following diagram commutes: 



Each triangle commutes by definition; and the commutativity axiom amounts 
then to the claim that we can paste such triangles together to get a larger 
commutative diagram. 

(Note, however, that just drawing a diagram with different routes from e.g. A 
to D doesn’t always mean that we have a commutative diagram - the equality 
of the composites along the paths in each case has to be argued for!) 
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3 Categories beget categories 


We have already seen that categories are plentiful. This chapter describes a 
number of methods for constructing new categories from old. (We’ll meet further 
methods later, but these first ones will be enough to be going on with.) 

3.1 Subcategories 

(a) Let’s start with a particularly simple way of getting a new category just 
slim down an old one. 

Definition 5. Given a category if 5? consists of the data 

(1) Objects: some or all of the ^-objects, 

(2) Arrows: some or all of the ^-arrows, 

subject to the conditions 

(3) for each ^-object C, the 'if- arrow Iq is also an J^-arrow, 

(4) for any ^-arrows / : C — ► D, g: D — > E, the ^-arrow g o / : C — > E is 
also an ,5^-arrow, 

then, with composition of arrows in S? defined as in the original category 5? 
is a subcategory of ^ . <1 

Plainly, the conditions in the definition - containing identity arrows for the 
remaining objects and being closed under composition - are there to ensure that 
the slimmed-down 5? is indeed still a category. 

Obviously, some cases where we prune an existing categories will leave us 
with unnatural constructions of no particular interest. Other cases can be more 
significant, and indeed we have already met some examples: 

(1) Set is a subcategory of Pfn, 

(2) FinSet is a subcategory of Set, 

(3) Ab is a subcategory of Grp, 

(4) The discrete category on the objects of ^ is a subcategory of ^ for any 
category. 
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3.2 Duality 


(b) As we have seen, then, we can shed objects and/or arrows in moving from 
a category to a subcategory. Three of our examples are cases where we keep all 
the objects but shed some or all of the non-identity arrows. But cases (2) and 
(3) are ones where we drop some objects while keeping all the arrows between 
those objects retained in the subcategory, and there is a standard label for such 
cases: 

Definition 6. If 5A is a subcategory of ^ where, for all J^-objects A and B , the 
J^-arrows from A to B are all the ^-arrows from A to B, then SA is said to be 
a full subcategory of ( to. <3 

We’ll meet more cases of full subcategories later. 

3.2 Duality 

Another easy, but particularly important, way of getting one category from an- 
other is to reverse all the arrows. More carefully: 

Definition 7. Given a category then its opposite or dual < A? op is the category 
with the data 

(1) The objects of ( €° v are just the objects of ^ again. 

(2) If / is an arrow of ^ with source A and target B , then / is also an arrow 
of c to op but now it is assigned source B and target A. 

(3) Identity arrows remain the same, i.e. 1/f = 1 a- 

(4) Composition-in-‘if op is defined in terms of composition-in-^ by putting 

f o°P g = go f. <3 

It is trivial to check that this definition is in good order and that to op is indeed 
a category. And it is trivial to check that (fA’ op ) op is So every category is the 
opposite of some category. 

Do be careful here, however. Take for example Set op . An arrow / : A — ► B 
in Set op is the same thing as an arrow f: B — > A in Set, which is of course 
a set-function from B to A. But this means that /: A — > B in Set op typically 
won’t be a function from its source to its target - it’s an arrow in that direction 
but usually only a function in the opposite one! (This is one of those cases 
where talking of ‘domains’ and ‘codomains’ instead of ‘sources’ and ‘targets’ 
could initially encourage confusion, since the domain of an arrow in Set op is its 
codomain as a function.) 

Take JA to be the elementary pure language of categories. This will be a 
two-sorted first-order language with identity, with one sort of variable for ob- 
jects, A,B,C . and another sort for arrows /, g,h, . . .. It has built-in function- 
expressions ‘src’ and ‘ tar ’ (denoting two maps from arrows to objects), a built-in 
relation ‘. . . is the identity arrow for . . . ’, and a two place function-expression 
‘. . . o . . . ’ which expresses the function which takes two composable arrows to 
another arrow. 
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Definition 8. Suppose p is a wff of Jz? . Then its dual p op is the wff you get by 
swapping (i) ‘src’ and ‘tar’ and swapping (ii) ‘/ o g' for ‘<7 o /’, etc. <1 

Now, the claim that ( tf op is a category just reflects the fact that the duals of 
the axioms for a category are also axioms. And that observation gives us the 
following duality principle: 

Theorem 2. Suppose p is an Jz? -sentence (a wff with no free variables) - so 
p is a general claim about objects /arrows in an arbitrary category. Then if the 
axioms of category theory entail p, they also entail the dual claim p° v . 

Since we are dealing with a first-order theory, syntactic and semantic entailment 
come to the same, and we can prove the theorem either way: 

Syntactic proof. If there’s a first-order proof of p from the axioms of category 
theory, then by taking the duals of every wff in the proof we’ll get a proof of 
p op from the duals of the axioms of category theory. But those duals of axioms 
are themselves axioms, so we have a proof of p op from the axioms of category 
theory. □ 

Semantic proof. If p always holds, i.e. holds in every category , then p op will 
hold in every e to op - but the < ^’ op s comprise every category again, so p op also 
holds in every category. □ 

The duality principle is very simple but also a hugely labour-saving result; 
we’ll see this time and time again, starting in the next chapter. 


3.3 Arrow categories and slice categories 

(a) For the moment, we will mention just two more ways of deriving a new 
category from a given category %? . The first again reinforces the point that the 
objects of category need not be really object-like: 

Definition 9. Given a category the derived arrow category has the fol- 
lowing data: 

(1) objects, its first sort of data, are simply the arrows of Y, 

(2) Given ‘^“’'-objects /i ,/2 (i.e. ^-arrows fi : X\ — > Yj, f 2 : X 2 — > Y 2 ), a 
Y^-axvow a : f± — > f 2 is a pair (j, k) of ^-arrows such that the following 
diagram commutes: 


X! - 
h 
Y\ - 


> X 2 
h 

-> >"2 
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3.3 Arrow categories and slice categories 


The identity arrow on /: X — ► Y is defined to be the pair (lx, 1 y)- And com- 
position of arrows (j, k): fi —> and (f, k') : fi — > fs is then defined in the 
obvious way to be ( j ' o j, k' o k) : f± — > fz (just think of pasting together two of 
those commuting squares). <1 

It is easily checked that this does indeed define a category. We’ll give examples 
of naturally arising arrow categories later. 


(b) Suppose next that ^ is a category, and I a particular ^-object. We will 
define a new category from this time the so-called ‘slice’ category tf/I where 
- as in an arrow category - the new category’s objects are again (some of) the 
original category’s arrows. 

Her’s the definition (and to keep things clear but brief, let’s use ‘arrowy’ to 
refer to the old arrows and reserve plain ‘arrow’ for the new arrows to be found 
in tf/I): 

Definition 10. Let be a category, and I be a ^-object. Then the new category 
ff/I, the slice category over /, has the following data: 

(1) The ‘if //-objects are all the arrows^ f : A — > /, for any ^-object A. 

(2) For ^//-objects the associated //-arrows from / 

to g are just the arrows^ j : A — >• B such that g o j = / in . 

(3) For every ^//-object f:A—>I, there is an identity arrow, namely the 
arrowy I 4 : A — > A. 

(4) For any ‘if //-objects f:A—>I,g:B—¥l,h:C—>I, the composition of 

j : f — » g and k : g — > h is defined as follows: k o j : / — > h is the arrowy 
k o j : A — »• C. <\ 


Of course, we need to check that these data do indeed together 
satisfy the axioms for constituting a category. So let’s do that. 

Diagrams will help. Take the arrows^ /: A B — ► I. 

They will be objects /, g of ^/I. And the arrows of to /I from 
f to g will be the arrows^ like j : A — > B which make our first 
diagram commute. (Note: the domain and co-domain of j as an 
arrowy are respectively A , B. But the domain and co-domain of j as an arrow 
in the slice category ‘tf/I are respectively / and g.) 

We now need to confirm that our definition of koj for cornpos- 
a ing ( to /I- arrows works. We are given in ^ that j : A — ► B is such 




that go j = /, and also that ho k = g. Putting things together 
we get the second commutative diagram. 

Or in equations, we have (h o k) o j = f in and therefore 
h o [k o j) = f. So ( k o j) does indeed count as an arrow in ^ /I 
from f to h, as we require. 

The remaining checks to confirm ( ra // satisfies the axioms for 


C 


being a category are then trivial. 
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Categories beget categories 


(c) There’s a dual notion we can define here, namely the idea of a co-slice 
category. This category //‘if has as objects the arrows^ /: / — > A, for any 
^-object A. And the rest of the definition is as you would predict given our 
explanation of duality: just go through the definition a slice category reversing 
arrows and the order of composition. (Check that this works!) 

(d) Here are a couple of quick examples of slice and co-slice categories, one of 
each kind: 

(1) Pick a singleton set ‘1’. We have mentioned before the idea that we can 
think of any element a; of X as an arrow x : 1 — > X . 

So now think about the co-slice category 1/Set. Its objects are all the 
morphisms x: 1 — > X (for any set X , one morphism for every point in 
x e X). 

We can think of each such morphism as in effect assigning a set X a 
special ‘basepoint’ x. Thought of like that, each object in 1/Set encodes 
a pointed set. And the arrows in 1/Set from some x: 1 — > X to some 
y: 1 — > Y (in effect, from X-with-basepoint- 2 : to X-with-basepoint-y) are 
all the maps / : X — > Y in Set such that f ox = y: so we can think of such 
maps as the maps which preserve basepoints. 

Hence we can intuitively think of 1/Set as being, in some strong sense, 
‘the same as’ the category Set* of pointed sets (we’ll return later to explain 
what ‘being the same as’ comes to for categories). 

(2) Second, take an n-membered index set I n = {ci, C 2 , C 3 , . . . , c n }. Think of 
the members of the set as ‘colours’. Then a morphism S — > I n is an n- 
colouring of the set S. So we can think of FinSet//„ as the category of n- 
coloured finite sets, which is exactly the sort of thing that combinatorialists 
are interested in. 

More generally, we can think of a slice category Set// as a category of 
‘indexed’ or ‘typed’ sets, with / providing the indices/types. 
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4 Kinds of arrows 


This chapter characterizes a number of kinds of arrows in terms of how they 
interact with other arrows. This will give us some elementary but characteristic 
examples of categorial, arrow-theoretic, (re) definitions of familiar notions. 

4.1 Monomorphisms, epimorphisms 

(a) Take a set-function / : A —> B living as an arrow in Set: how could we say 
that it is injective, i.e. one-one, using just category-speak about arrows? 

We noted that we can think of elements x of /’ s domain A as arrows x: 1 — > A 
(where 1 is some singleton). Injectiveness then comes to this: fox = foy implies 
x = y , for any element-arrows x, y. Hence if a function is more generally ‘left- 
cancellable’ in Set meaning that, for any g,h 1 f o g = f o h implies g = h - 
then it certainly has to be an injection. 

Conversely, if / is injective as a set-function, then for all x, f(g(x)) = f{h{x)) 
implies g{x) = h(x) - which is to say that if / o g = f o h then g = h, i.e. / is 
left-cancellable. 

So that motivates introducing a notion with the following definition (the new- 
fangled terminology comes from abstract algebra): 

Definition 11. An arrow /: C — > D in the category ^ is a monomorphism (is 
monic ) if and only if it is left-cancellable, i.e. for every pair of maps g: B — » C 
and h: B — > C , if / o g = / o h then g = h. <1 

We have just proved 

Theorem 3. The monomorphisms in Set are exactly the injective functions. 

The same applies in many, but not all, other categories where arrows are 
functions. For example, we have: 

Theorem 4. The monomorphisms in Grp are exactly the injective group homo- 
morphisms. 

Proof. We can easily show as before that the injective group homomorphisms 
are monomorphisms in Grp. 
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For the other direction, suppose /: C — > D is a group homorphism between 
the groups ( C,-,ec ) and (£>,*, en) but is not an injection. 

We must have /(c) = f(c') for some c,c' G C where c yl c' . Note then that 

/(c _1 • c') = /(c- 1 ) * /(c') = /(c- 1 ) * /(c) = /(c" 1 • c) = f(e c ) = e D . 

So c _1 • d is an element in K C C, the kernel of / (the set of elements that / 
sends to the unit of (D,*, eu)). Since c ^ c' , we have c -1 • c' ^ ec, and hence 
A" has more than one element. 

Now define g : K — > C to be the obvious inclusion map (which send an element 
of K to the same element of C), while h: K — ► C sends everything to ec- Since 
K has more than one element, g ^ h. But obviously, / ° g = f ° h (both send 
everything in K to ec)- So / isn’t left-cancellable. 

Hence, contraposing, if / is monic in Grp it is injective. □ 

(b) Next, here is a companion definition: 

Definition 12. An arrow /: C — > D in the category c to is an epimorphism (is 
epic) if and only if it is right-cancellable, i.e. for every pair of maps g: D — > E 
and h: D — > E, if go f = ho / then g — h. <1 

Evidently, the notion of an epimorphism is dual to that of a nronomorphism. 
Hence / is right-cancellable and so epic in ^ if and only if it is left-cancellable 
and hence monic in ( d’ op . And, again predictably, just as monomorphisms in the 
category Set are injective functions, we have: 

Theorem 5. The epimorphisms in Set are exactly the surjective functions. 

Proof. Suppose / : C — > D is surjective. And consider two functions g,h: D -* E 
where g ^ h. Then for some d G D } g(d) ^ h(d). But by surjectivity, d = /(c) 
for some c € C. So g(f(c)) ^ h(f(c)), whence go f ^ ho f. So contraposing, the 
surjectivity of / in Set implies that if g o / = h o /, then g = h, i.e. / is epic. 

Conversely, suppose /: C — > D is not surjective, so f[C] ^ D. Consider two 
functions g: D — » E and h: D — ► E which agree on f[C] C D but disagree 
on the rest of D. Then g ^ h, even though by hypothesis g o f and h o f will 
agree everywhere on C, so / is not epic. Contraposing, if / is epic in Set, it is 
surjective. □ 

A similar result holds in many other categories, but in §4.3, Ex. (2), we’ll en- 
counter a case where we have an epic function which is not surjective. 

As the very gentlest of exercises, let’s add for the record a mini-theorem: 

Theorem 6. (1) Identity arrows are always monic. Dually, they are always 

epic too. 

(2) If f , g are monic, so is f o g. If f , g are epic, so is f o g. 

(3) If f o g is monic, so is g. If f o g is epic, so is f. 


20 



4.2 Inverses 


Proof. (1) is trivial. 

Next, we need to show that if (/ o g) o j = (/ o g) o k, then j = k. So suppose 
the antecedent. By associativity, / ° (<? ° j) = f o (g o k). Whence, assuming / 
is monic, g o j = g o k. Whence, assuming g is monic, j = k. Which establishes 
that if / and g are monic, so is (/ o g). 

Interchanging / and g , if / and g are monic, so is (go f): applying the duality 
principle it follows that / and g are epic, so is (fog). 

For (3) assume fog is monic. Suppose goj = gok. Then fo(goj) = fo(gok), 
and hence (/ o g) o j = (/ o g) o k, so j = k. Therefore if g o j = gok then j = k; 
i.e. g is monic. Dually again for epics. □ 

(c) We should note a common convention of using special arrows in repre- 
sentational diagrams, a convention which we will follow occasionally but not 
religiously: 

f 

f : C >—* D or C > > D represents a monomorphism /, 

/ : C -» D or C » D represents an epimorphism. 

As a useful mnemonic (well, it works for me!), just think of the alphabetic 
ordering (ML) a monomorphism is le ft cancellable and its representing arrow 
has an extra fletch on the Zeft, while (PR) an epimorphism is right cancellable 
and its representing arrow has an extra fletch on the right. 

4.2 Inverses 

(a) We next define some more types of arrow: 

Definition 13. Given an arrow /: C — > D in the category 

(1) g: D — > C is a right inverse of / iff / o g = l]j. 

(2) g: D — > C is a left inverse of / iff g o f = l c . 

(3) g : D — ► C is an inverse of / iff it is both a right inverse and a left inverse 

of/. < 

Three remarks. First, on the use of ‘left’ and ‘right’. Note that if we represent 
the situation in (1) like this 

D » C » D 



1 D 


then /’ s right inverse g appears on the left! It is just a matter of convention that 
we standardly describe handedness by reference to the representation 1 fog = \ D ' 
rather than by reference to our diagram. (Similarly, of course, in defining left- 
cancellability, etc.) 
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Second, note that g o / = 1q in ^ iff f o op g = l c in ( to op . So a left inverse in 
if is a right inverse in c €° v . And vice versa. The ideas of a right inverse and left 
inverse are therefore, exactly as you would expect, dual to each other; and the 
idea of an inverse is dual to itself. 

Third, if / has a right inverse g , then it is itself a left inverse (of g, of course!). 
Dually, if / has a left inverse, then it is a right inverse. 

It is obvious that an arrow / need not have a left inverse: just consider, for 
example, those arrows in Set which are many-one functions. An arrow / can also 
have many left inverses: for a toy example in Set again, consider /: {0} — > {0, 1} 
where /( 0) = 0. Then the map g: {0,1} — > {0} is a left inverse so long as 
g{ 0) = 0, which leaves us two choices for <7(1), and hence we have two left 
inverses. 

By the duality principle, an arrow can also have zero or many right inverses. 
However, 

Theorem 7. If an arrow has both a right inverse and a left inverse, then these 
are the same and are the arrow’s unique inverse. 

Proof. Suppose / : G — >■ D has right inverse r-.D^C and left inverse s : D — » 
G. Then 

r = l c or=(so/)or = so(/or) = sol D = s. 

Or, in a commuting diagram, 


r 



s 


Hence r = s and r is an inverse. 

Suppose now that / has inverses r and s. By definition r will be a right inverse 
and s a left inverse, so as before r = s. So in fact inverses are unique. □ 

(b) By way of an aside, let’s remark that just as we can consider a particular 
monoid as a category, in the same way we can consider a particular group as a 
category. Take a group (G, -, e) and define to be the corresponding category 
whose sole object is whatever you like, and whose arrows are the elements g of 
G, with e the identity arrow. Composition of arrows in Sf is defined as group- 
multiplication of elements in G. And since every element in the group has an 
inverse, it follows immediately that every arrow in the corresponding category 
has an inverse. So in sum, a group-as-a-category is a category with one object and 
whose every arrow has an inverse. (And generalizing, a category with perhaps 
more than one object but whose arrows all still have inverses is called a groupoid.) 
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(c) Now, how does talk of an arrow as a right inverse/left inverse hook up to 
talk of an arrow as monic/epic? 

Theorem 8. (1) In general, not every monomorphism is a right inverse; and 

dually, not every epimorphism is a left inverse. 

(2) But every right inverse is monic, and every left inverse is epic. 

Proof. (1) can be shown by a toy example. Take the category 2 which we met 
back in §2.2, Ex. (17) - i.e. take as a category the two-elenrent pre-ordered set 
which has just one non-identity arrow. That non-identity arrow is trivially monic 
and epic, but it lacks both a left and a right inverse. 

For (2), suppose / is a right inverse for e, which means that eof = 1 (suppress- 
ing unnecessary labellings of domains and codomains). Now suppose fog = f oh. 
Then eo f o g = eo f oh, and hence 1 o g = loft,, i.e. g = ft, so indeed / is monic. 
Similarly for the dual. □ 

So monies need not in general be right inverses nor epics left inverses. But how 
do things pan out in the particular case of the category Set? Here’s the answer: 

Theorem 9. In Set, every monomorphism is a right inverse apart from arrows 
of the form 0 — ► D. Also in Set, the proposition that every epimorphism is a left 
inverse is (a version of) the Axiom of Choice. 

Proof. Suppose f : C — > D in Set is monic. It is therefore one-to-one between C 
and f [C] , so consider a function g: D — » C that reverses / on f [C] and somehow 
or other maps D — f[C] into C. Such a g is always possible to find in Set unless 
C is the empty set. So g o f = 1 c, and hence / is a right inverse. 

Now suppose / : C — > D in Set is epic, and hence a surjection. Assuming the 
Axiom of Choice, there will be a function g: D — >■ C which maps each d £ D to 
some chosen one of the elements c such that /(c) = d. And given such a function 
g, f ° 9 = Id, so / is a left inverse. 

Conversely, suppose we have a partition of C into disjoint subsets indexed 
by (exactly) the elements of D. Let f : C — > D be the function which sends an 
object in C to the index of the partition it belongs to. / is surjective, hence epic. 
Suppose / is also a left inverse, so for some g : D — > C, f o g = 1 D . Then g is 
evidently a choice function, picking out one member of each partition. So the 
claim that all epics have a left inverse gives us (one version of) the Axiom of 
Choice. □ 

(d) There is an oversupply of other jargon hereabouts, also in pretty common 
use. We should note the alternatives for the record. 

Assume we have a pair of arrows in opposite directions, / : C — > D, and 
g- D C . 

Definition 14. If g o f = 1 q, then / is also called a section of g , and g is a 
retraction of /. (In this usage, f is a section iff it has a retraction, etc.) <1 
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Definition 15. If / has a left inverse, then / is a split monomorphism ; if g has a 
right inverse, then g is a split epimorphism. (In this usage, we can say e.g. that 
the claim that every epimorphism splits in Set is the categorial version of the 
Axiom of Choice.) <1 

Note that Theorem 8 tells us that right inverses are monic, so a split monomor- 
phisnr is indeed properly called a monomorphism. Dually, a split epimorphism 
is an epimorphism. 

4.3 Isomorphisms 

(a) We are familiar, before we ever encounter category theory, with the notion 
of an isomorphism between structured sets (between groups, between topolog- 
ical spaces, whatever): it’s a bisection between the sets which preserves all the 
structure. In the extremal case, in the category Set of sets with no additional 
structure, the arrows which are both monic and epic provide the bijective func- 
tions. Can we generalize from this case and define the isomorphisms of any 
category to be arrows which are monic and epic there? 

No. Isomorphisms properly so called need to have inverses (we crucially want 
being isomorphic to be an equivalence relation, and hence in particular to be 
symmetric). But being monic and epic doesn’t always imply having an inverse. 
We can use again the toy case of 2, or here’s a generalized version of the same 
idea: 

(1) Take the category S* corresponding to the pre-ordered set (S,=4). Then 
there is at most one arrow between any given objects of S fi . But if / o g = 
/o/i, then g and h must share the same object as domain and same object 
as codomain, hence g = h, so / is monic. Similarly it is epic. But no arrows 
other than identities have inverses. 

The arrows in that example aren’t functions, however. So here’s a revealing case 
where the arrows are functions but where being monic and epic still doesn’t 
imply having an inverse: 

(2) Consider the category Mon of monoids. Among its objects are N = (N, +, 0) 
and Z = (Z, +, 0) - i.e. the monoid of natural numbers equipped with ad- 
dition and the monoid of positive and negative integers equipped with 
addition. 

Consider the natural ‘inclusion’ map i: N — * Z which sends a natural 
number to the corresponding positive integer (this is not a true inclusion 
map if we take it that naturals in N are a different type of thing to integers 
in Z). This map obviously does not have an inverse in Mon. We can show, 
however, that it is both monic and epic. 

First, suppose M = (M. •, 1 m) is some monoid and we have two arrows 
g,h: M — > N, where g ^ h. So there is some element m £ M such that 
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the natural numbers g(m) and h(rn) are different, which means that the 
corresponding integers i(g(m)) and are different, so i o g i o h. 

Contraposing, this means i is monic in the category. 

Second, again take a monoid M and this time consider any two monoid 
homomorphisms g,h: Z — > M such that g o i = h o i. Then g and h must 
agree on all integers from zero up. But then note 

S(-1) = 9 (~ !) ' !m = g{~ 1) ' h{ 0) = g(- 1) • h( 1 + -1) 

= g{- 1) • M 1 ) • H- 1 ) = gh 1 ) ■ s(i) • K~ l ) 

= g (- 1 + 1) • h(-l) = g( 0) • ft(-l) = 1 M ■ h(- 1) = ft(-l). 
But if g(— 1) = ft(— 1), then 

5(-2) = g(- 1+-1) = fl(-l)-5(-l) = = /i(-l+-l) = *■(— 2 ),. 

and so it goes. So we have g(j) = h(j) for all j G Z, positive and negative, 
and hence g = h and i is right-cancellable, i.e. epic. 

So in sum: we can’t define an isomorphism as an epic monic if isomorphisms are 
to have the essential feature of invertibility. 

(b) What to do? Build in that feature from the start, and say: 

Definition 16. An isomorphism (in category ‘if) is an arrow which has an inverse. 
We conventionally represent isomorphisms by decorated arrows, thus: < 

From what we have already seen, we know or can immediately check that 

Theorem 10. (1) Identity arrows are isomorphisms. 

(2) An isomorphism f: C D has a unique inverse which we can call 
f i • D ^ C, such that f- 1 o / = l c , f o f - 1 = 1 D , (/- 1 )- 1 = f, 
and f^ 1 is also an isomorphism. 

(3) If f an d g are isomorphisms, then g o f is an isomorphism if it exists, 
whose inverse will be / -1 o g~ x . 

So let’s immediately give some simple examples of isomorphisms in different 
categories: 

(1) In Set, the isomorphisms are the bijective set-functions. 

(2) In Grp, the isomorphisms are the bijective group homomorphisms. 

(3) In Vectk, the isomorphisms are invertible linear maps. 

(4) In a group treated as a category, every arrow is an isomorphism. 

(5) But as we noted, in a pre-order category, the only isomorphisms are the 
identity arrows. 
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(c) Isomorphisms are rnonic and epic by Theorem 8: but we now know that 
arrows which are rnonic and epic need not be isomorphisms as we have just 
defined them. However, it is worth remarking that we do have this: 

Theorem 11. If f is both monic and split epic (or both epic and split rnonic), 
then f is an isomorphism. 

Proof. If / is a split epimorphism, it has a right inverse, i.e. there is a g such 
that / o g = 1 . Then (/ o g) o f = /, whence / o (g o f) = / o 1 . Hence, given that 
/ is also mono, g o f = 1 , and g is both a left and right inverse for /, i.e. / has 
an inverse. Dually for the other half of the theorem. □ 

And we’ll also mention another easy result: 

Theorem 12. If f and g are both monic, and each factors through the other, 
i.e. there is an i such that f = g o i and there a j such that g = f o j, then the 
factors i and j are isomorphisms. 

In other words, if each of the triangles in the following diagram commutes, then 
so does the whole diagram: 



Proof. We have / o \ x = f = goi = f o j oi. Hence, since / is monic, j o i = l x . 
Similarly, i o j = 1 Y . So i and j are each others two-sided inverse, and both are 
isomorphisms. □ 


4.4 Isomorphic objects 

Definition 17. If there is an isomorphism f: C D in then the objects 
C , D are said to be isomorphic in < ^’, and we write C = D. <3 

From the ingredients of Theorem 10, we immediately get the desirable result 
that 

Theorem 13. Isomorphism between objects in a category is an equivalence rela- 
tion. 

Now, we might wonder how far this notion of isomorphism between objects 
captures the idea of two objects amounting to the same as far as their ambient 
category is concerned. We mentioned before the example where we have, living 
in Grp, lots of instances of a Klein four-group which are group-theoretically 
indiscernible by virtue of being isomorphic (indeed, between any two instances, 
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there is a unique isomorphism). And yes, we then cheerfully talk about the Klein 
four-group. 

There is a real question, however, about just what this way of talking amounts 
to, when we seemingly identify isomorphic objects. Some claim, indeed, that cat- 
egory theory itself throws a lot of light on this very issue (see e.g. Mazur 2008). 
Well certainly, category theory typically doesn’t care about distinguishing iso- 
morphic objects in a category. But note that it would, for example, initially 
strike us as odd to say that because all the instances of singleton sets are iso- 
morphic (indeed, between any two instances, there is a unique isomorphism) we 
can talk about the singleton. After all, the pairwise distinctness of all the sin- 
gletons {0}, {{0}}, {{{0}}}, {{{{0}}}}; • • ■ is precisely what allows us to use this 
sequence of distinct sets as one possible construction (Zermelo’s) for the natural 
numbers. 

But we can’t delay to explore this issue any further at the moment: we are 
just flagging up that there are issues we’ll at some point need to discuss around 
and about the idea of isomorphism-as-sameness. 
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Our definition of an isomorphism characterizes a type of arrow not by, so to 
speak, its internal workings, but by reference to its interaction with another 
arrow, its inverse. This is entirely typical of a category-theoretic (re)definition 
of a familiar notion: we look for similarly external, relational, characterizations 
of arrows and/or structured objects. 

Here’s Awodey, offering some similarly arm-waving 

. . . remarks about category-theoretical definitions. By this I mean 
characterizations of properties of objects and arrows in a category in 
terms of other objects and arrows only, that is, in the language of 
category theory. Such definitions may be said to be abstract, struc- 
tural, operational, relational, or external (as opposed to internal). 

The idea is that objects and arrows are determined by the role they 
play in the category via their relations to other objects and arrows, 
that is, by their position in a structure and not by what they ‘are’ 
or ‘are made of’ in some absolute sense. (Awodey, 2006, p. 25) 

Over the next few chapters, then, we proceed to give further such external 
category-theoretic definitions of familiar notions. A prime exhibit will be the 
treatment of products, starting in the next chapter. In this chapter, we warm 
up by considering a particularly simple pair of cases. 

5.1 Initial and terminal objects, definitions and examples 

Definition 18. The object I is an initial object of the category c ta iff, for every 
^-object X, there is a unique arrow I — >• X. 

Dually, the object T is a terminal object of c /a iff, for every ^-object X, there 
is a unique arrow X — > T. 1 

It is common to use the likes of ‘! : / — > X’ or ‘! : X — > T’ for the unique arrow 
from an initial object or to a terminal object. <1 

Some examples: 

1 Warning: some call terminal objects final ; and then that frees up ‘terminal’ to mean initial 
or final. 
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(1) In the poset (N, thought of as a category, zero is trivially the unique 
initial object and there is no terminal object. The poset (Z, has neither 
initial nor terminal objects. 

(2) More generally, a poset-(S, =^)-treated-as-a-category has an initial object 
iff the poset has a minimum, an object which ^-precedes all the others. 
Dually for terminal objects/maxima. 

(3) In Set, the empty set is an initial object (cf. the comment on Ex. 1 in §2.2). 

And any singleton set {*} is a terminal object. (For if X has members, 
there’s a unique Set-arrow which sends all the members to ★; while if X is 
empty, then there’s a unique Set-arrow to any set, including {★}). 

(4) In Set* - the category of pointed sets, non-empty sets equipped with a 
distinguished member - each singleton is both initial and terminal. (A 
singleton’s only member has to be the distinguished member in a pointed 
set. Arrows in Set* are functions which map distinguished elements to 
distinguished element. Hence there can be one and only one arrow from a 
singleton to some some other pointed set.) 

(5) In Pos, the category of posets, the empty poset is initial, and any singleton 
equipped with the only possible order relation on it (the identity relation!) 
is terminal. 

(6) In Rel, the category of sets and relations, the empty set is both the sole 
initial and sole terminal object. 

(7) In Top, the empty set (considered as a trivial topological space) is the 
initial object. Any one-point singleton space is a terminal object. 

(8) In Grp, the trivial one-elenrent group is an initial object (a group has to 
have at least one object, the identity; now recall that a group homomor- 
phism sends identity elements to identity elements; so there is one and 
only one homomorphism from the trivial group to any given group G). 
The same one-element group is also terminal. 

(9) In the category Bool, the trivial one-object algebra is terminal. While the 
two-object algebra on {0, 1} familiar from propositional logic is initial 
for a homomorphism of Boolean algebras from {0, 1} to B must send 0 to 
the bottom object of B and 1 to the top object, and there’s a unique map 
that does that. 

(10) Recall: in the slice category ff /X an object is a ^-arrow like f:A—*X l 
and an arrow from f: A — > X to g: B — >■ AT is a < f- arrow j : A — ► B such 
that go j = f in f . Consider the ^/X-object lx ■ X — > X. A fX arrow 
from / : A — > X to lx is a ^-arrow j : A — > X such that 1 x ° j = /, he. 
such that j = f - which exists and is unique! So lx is terminal in ‘if /X. 

Such various cases show that a category may have zero, one or many initial ob- 
jects, and (independently of that) may have zero, one or many terminal objects. 
Further, an object can be both initial and terminal. 
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There is, incidentally, a standard bit of jargon for the last case: 

Definition 19. An object O in the category ^ is a null object of the category ^ 
iff it is both initial and terminal. <1 


5.2 Uniqueness up to unique isomorphism 

A category ^ , to repeat, may have no initial objects, or only one, or have many. 
However, we do have the following key result: 

Theorem 14. Initial objects, when they exist, are ‘unique up to unique isomor- 
phism’: i.e. if the -objects I and J are both initial in the category , then there 
is a unique isomorphism f : I J in . Dually for terminal objects. 

Further, if I is initial and I = J, then J is initial. Dually for terminal objects. 

Proof. Suppose I and J are both initial objects in %f. By definition there must be 
unique ^-arrows /: / — > J, and g: J I. Then go f is an arrow from / to itself. 
Another arrow from I to itself is the identity arrow 1/. But since I is initial, 
there can only be one arrow from / to itself, so g o / = 1/. Likewise / o g = Ij. 
Hence the unique arrow f has a two-sided inverse and is an isomorphism. (Note 
this pattern of argument: we’ll be using it a lot!) 

Now suppose / is initial, and that there is an isomorphism i: I J. Then 
for any X , there is a unique arrow f: I — > X, and hence there is an arrow 
/ o 1 : J — )■ X. Now suppose we also have g : J — »• X. Then g o i : I — ► X, and 
so g o i = /, hence (g o i) o i~ x = / o i -1 , hence g = / o i -1 . In sum, for any X 
there is a unique arrow from J to X , thus J is also initial. 

Duals of these two arguments deliver, of course, the dual results. □ 

It is standard to introduce notation for an arbitrary initial and terminal ob- 
jects (since categorically, we usually don’t care about distinctions among in- 
stances): 

Definition 20. We use ‘0’ to denote an initial object of (assuming one exists), 
and likewise T’ to denote a terminal object. <1 

Note that in Set, 0 is 0, the only initial object - and 0 is also the von Neumann 
ordinal 0. While the von Neumann ordinal 1 is {0}, i.e. a singleton, i.e. a terminal 
object 1. Which perhaps excuses the recycling of the notation. 

By the way, null objects (objects which are both initial and terminal) are often 
alternatively called ‘zero’ objects. But that perhaps doesn’t sit happily with 
using ‘0’ for an initial object: for 0 (in the sense of an initial object) typically 
isn’t a zero (in the sense of null) object. Hence our preference for ‘null’. 


30 



5.3 Elements and generalized elements 


5.3 Elements and generalized elements 

(a) Consider the category Set again. And take a terminal object, a singleton 1. 
Now, an arrow x : 1 — > X is a set-function sending the member of the singleton 1 
to some member x € X. Trivially there is exactly one such arrow for any x € X, 
with different arrows corresponding to different elements. So, as we’ve remarked 
before, in Set we can think of talk of such arrows x: 1 — >■ X as the categorial 
version of talking of elements of X. 

This motivates the following more general definition: 

Definition 21. In a category Y? with a terminal object 1, an element or point of 
the “^-object X is an arrow x: 1 — > X. <1 

(In fact, the standard terminology for such an element is 1 global element’, picking 
up from a paradigm example in topology - but we won’t fuss about that.) 

Theorem 15. Elements x: 1 — > X in a category are monic. 

Proof. Suppose xo f = xog-, then, for the compositions to be defined and equal, 
both f and g must be morphisms Y — > 1, for the same Y. Hence / = g since 1 
is terminal. □ 

Definition 22. Suppose the category Y has a terminal object, and for any object 
X, Y in Y , and arrows /, g : X — > Y, f = g iff for all x : 1 — > X, f o x = g o x. 
Then Y is said to be well-pointed. <\ 

Think of it this way: in a well-pointed category, there are enough elements 
(points) to ensure that arrows which act identically on all relevant elements 
are indeed identical. 

Theorem 16. Set is well-pointed. Grp, for example, is not. 

Proof. The claim about Set is immediate. 

In Grp, for example, a homomorphism from 1 (the one-element group) to a 
group X has to send the only element of 1 to the identity element e of X: so 
there is only one possible homomorphism e: 1 —> X (so the ‘elements’-as-arrows 
don’t in general line up one-to-one with elements of the group). 

Take any two group homomorphisms f,g: X — >• Y where / ^ g: yet for all 
possible e : 1 — ► X, both foe and goe send the sole element of 1 to the identity 
element of Y, so are equal. □ 

(b) We have just seen that, even when arrows in a category are functions, acting 
the same way on elements-as-arrows need not imply being the same arrow. But 
suppose we introduce this notion: 

Definition 23. A generalized element (of shape S) of the object X in Y is an 
arrow e: S — > X. < 
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(So a generalized element of X in ^ is in fact the same as an object of the 
slice category to /X.) Generalized elements give us, so to speak, more ways of 
probing objects. And we will later see that, while in Set an object - i.e. a set - 
is determined by its point-elements, in a general category we only have that 
an object is determined by its generalized elements. 

For the moment, though, we just note that: 

Theorem 17. Parallel arrows in a category to are identical if and only if they 
act identically on all generalized elements. 

Proof. If /, g : X Y act identically on all generalized elements, they act iden- 
tically on lx : X — ► X: therefore / o l x = g ol x , and so / = g. The converse is 
trivial. □ 
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Our next main topic will be a categorial treatment of products (as in Cartesian 
products). The paradigm construction in Set takes sets X and Y and forms 
the set of ordered pairs of their elements. But what are ordered pairs? What 
is involved in pairing schemes more generally? We’ll start by considering these 
sweeping questions as our route in to a categorial treatment of products. 


6.1 Real pairs, virtual pairs 

A word of caution first. We have fallen into the modern practice of using paren- 
theses in multiple ways (and we didn’t want to pause distractingly to discuss 
this before). But now we should draw an important distinction. 

We have, as is standard, used parentheses as in l (x,y)’ or ‘(j, k)' to refer 
to ordered pairs, and as in ‘(A, /,£?)’ to refer to an ordered triple. Here, the 
parentheses do essential work, expressing constructors sending given objects to 
real pairs (triples, etc.) which are objects distinct from their components. 

But we have also used parentheses in contexts where, at least initially, we 
can take them as simply helpful punctuation, not as a signifying a constructor 
for a new object. For example, when talking informally of the pre-ordered set 
‘(5, =$)’, we are initially just talking about the set S and about the ordering 
=<I defined over S, and we are not - or at least, not straight off - referring to 
some further pair-object. Here, we might say, the apparent reference to a pair is 
merely virtual, and can be translated away. Likewise, in talking of the monoid 
‘(M, •, 1 m)’ we are initially talking about a set, an operation on it and a selected 
element and not - or at least, not straight off about some further object. It 
may be that at some point we do need to regard pre-ordered sets or monoids 
as single objects, over and above the relevant sets and relations/functions, i.e. 
regard them as real rather than virtual pairs or triples. But that needn’t be 
understood as built into the notation from the very start. Our principle should 
always be: read such notation as noncommittally as possible. 

In this chapter, though, we are indeed concerned with real pairs. But what 
are they? 
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6.2 Pairing schemes 

(a) Suppose for a moment that we are working in a theory of arithmetic and 
want to start talking about ordered pairs of natural numbers - perhaps we 
want to go on to use such pairs in constructing integers or rationals. Well, we 
can handle such pairs without taking on any new commitments by using code- 
numbers. For example, if we want a bijective coding between pairs of naturals 
and all the numbers, we could adopt the scheme of coding the ordered pair 
(to, n) by the single number (to, n) = {(to + n) 2 + 3 to + n}/ 2. Or, if we don’t 
insist on every number coding a pair, we could adopt the simpler policy of using 
(to, n) =def 2 m 3 n . Relative to a given coding scheme, we can call such code- 
numbers (to, n) pair-numbers. Or, by a slight abuse of terminology, we can call 
them simply pairs, and we can refer to to as the first element of the pair, and n 
as the second element. 

Why should this way of handling ordered pairs of natural numbers be re- 
garded as somehow inferior to other, albeit more familiar, coding devices such 
as explicitly set-theoretic ones? 

It might be said that (i) a single pair-number is really neither ordered nor a 
twosome; (ii) while a number to is a member of (or is one of) the pair of to with 
n, a number can’t be a genuine member of a pair- number (to, n); and in any case 
(iii) such a coding scheme is pretty arbitrary (e.g. we could equally well have 
used 3 m 5” as a code for the pair to, n). 

Which is all true. But we can lay exactly analogous complaints against e.g. 
the familiar Kuratowski definition of ordered pairs that we all know and love. 
This treats the ordered pair of m with n as the set ( m,n)K = {{to}, {to, n}}. 
But (i) that set is not intrinsically ordered (after all, it is a set!), nor is it always 
two-membered (consider the case where to = n). (ii) Even when it is a twosome, 
its members are not the members of the pair: in standard set theories, to cannot 
be a member of {{to}, {?n,?i}}. And (iii) the construction again involves pretty 
arbitrary choices: thus {{77}, {to, n}} or {{{to}}, {{to, ti}}} etc., etc., would have 
done just as well. On these counts, at any rate, coding pairs of numbers by using 
pair-numbers involves no worse a trick than coding them using Kuratowski’s 
standard gadget. 

There is indeed a rather neat symmetry between the adoption of pair numbers 
as representing ordered pairs of numbers and another very familiar procedure 
adopted by the enthusiast for working in standard ZFC. For remember that pure 
ZFC knows only about pure sets. So to get natural numbers into the story at all 
- and hence to get Kuratowski pair-sets of natural numbers - the enthusiast for 
sets has to choose some convenient sequence of sets to implement the numbers (or 
to ‘stand proxy’ for numbers, ‘simulate’ them, ‘play the role’ of numbers, or even 
‘define’ them - whatever your favourite way of describing the situation is). But 
someone who, for her purposes, has opted to play the game this way, treats pure 
sets as basic and is dealing with natural numbers by selecting some convenient 
sets to implement them, is hardly in a position to complain about someone else 
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who, for his purposes, does the opposite and treats numbers as basic, and deals 
with ordered pairs of numbers by choosing some convenient code-numbers to 
implement them. Both theorists are in the implementation game. 

It might be retorted that the Kuratowski trick at least has the virtue of being 
an all-purpose device, available not just when you want to talk about pairs of 
numbers, while e.g. the powers-of-primes coding is of much more limited use. 
Again true. Similarly you can use sledgehammers to crack all sorts of things, 
while you can only use nutcrackers for nuts. But that’s not particularly to the 
point if it happens to be nuts you currently want to crack, efficiently and with 
minimum resources. If we want to implement pairs of numbers without onto- 
logical inflation - say in pursuing the project of ‘reverse mathematics’ (with 
its eventual aim of exposing the minimum commitments required for e.g. doing 
classical analysis) - then pair-numbers are just the kind of thing we need. 

(b) So what, more generally, does it take to have a way of pairing-up an object 
x £ X with an object y £Y7 

We need some objects O to serve as ordered pairs, a pairing function that 
sends a given x and y to a pair-object o £ O, and (of course!) a couple of 
functions which allow us to recover x and y from o. And the point we’ve just 
been making is that maybe we shouldn’t care too much about the ‘internal’ 
nature of the objects O, so long as we do have suitable associated pairing and 
impairing functions which work in the right way (for example, pairing and then 
impairing gets us back to where we started). Which motivates: 

Definition 24. Suppose X , Y and O are sets of objects (these can be the same 
or different). Let pr: X,Y — > O be a two-place function, while 7Ti : O — > X, and 
7t 2 : O — > Y, are one-place functions. Then [O, pr, 7Ti, 712 ] form a pairing scheme 
for X with Y iff 

(a) (Vx £ X)(\/y £ Y)('rr 1 (pr(x,y)) = x A n 2 {pr(x, y)) = y), 

(b) (Vo £ O ) pr(irio, 7 r 2 o) = o. 

The members of O will be said to be the pair-objects of the pairing scheme, with 
pr the associated pairing function, while 7Ti and 7r 2 are impairing or projection 
functions. <1 

Evidently, if O is the set of naturals of the form 2 m 3™ and pr(m,n) = 2 m 3”, 
with 7Tio (7 t 2 o) returning the exponent of 2 (3) in the factorization of o, then 
[O, pr, 7Ti, 7t 2 ] form a pairing scheme for N with N. And if O' is the set of Kura- 
towski pairs {(to, ti)k I fn, n £ N}, with pr'(m, n) = (m, u)k, and 7Ti (7t 2 ) taking 
a pair and returning its first (second) element, then [O ' , pr' , tt) , 7r 2 ] form another 
pairing scheme for N with N 

By the way, don’t over-interpret the square brackets: they do need here to be 
read as no more than punctuation. After all, we are in the business of character- 
izing pairs-as-single-objects; so we don’t want to presuppose e.g. that we already 
know about quadruples-as-single-objects! 
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Two simple facts about pairing schemes: 

i. Different pairs of objects are sent by pr to different pair-objects. For sup- 
pose pr(x,y) = pr(x',y'). Then by (a) x = ni(pr(x, y)) = wi (pr(x',y')) = 
x’ , and likewise y — y’ . 

ii. Note that by (b), pr is surjective. The ‘impairing’ or ‘projection’ functions 
7r i and 7T2 are also surjective. For given x € X, take any y G Y and put 
o = pr(x, y). Then by (a), x = 7TiO. Likewise, given y GY there is an o GO 
such that y = 'K‘ 2 . 0 . 

So, in sum, pairing schemes basically work as you would expect. 

As we’d also expect, a given pairing function fixes the two corresponding 
projection functions, and vice versa, in the following sense: 

Theorem 18. (1) If [0,pr, 7Ti,7T2] and [O, pr, 7rj, 7^] are both pairing schemes 
for X with Y, then 7Ti = 7rj and n 2 = tt' 2 . 

(2) If [0,pr, 7 Ti,7 T2] and [0,pr' ,tti, 712 ] are both pairing schemes, then pr = 
pr' . 

Proof. For (1), take any o G O. There is some (unique) x, y such that o = 
pr(x,y). Hence, applying (a) to both schemes, 7Tio = x = 7 rjo. Hence 7Ti = 7rj, 
and similarly 7T2 = 7r 2 . 

For (2), take any x G X, y G Y, and let pr(x,y) = o, so 7Tio = x and 
tt 2 o = y. Then by (b) applied to the second scheme, pr' (it 10 , 1 : 20 ) = o. Whence 
pr'(x,y) = pr(x,y). □ 

Further, there is a sense in which all schemes for pairing X with Y are equiv- 
alent up to isomorphism. More carefully, 

Theorem 19. If[0, pr,7Ti,7r 2 ] and [0',pr',it' 1 ,n , 2 \ are both schemes for pairing X 
with Y, then there is a unique bijection f : O — > O' such that for all x G X,y G Y , 
pr'(x,y) = f(pr(x, y)). 

Putting it another way, there is a unique bijection / such that, if we pair x with 
y using pr (in the first scheme), use f to send the resulting pair-object o to o', 
and then retrieve elements using 7r( and 712 (from the second scheme), we get 
back to the original x and y. 

Proof. Define / : O — > O' by putting /(o) = pr'(nio, tt- 2 o). Then it is immediate 
that f(pr(x, y)) =pr'(x,y). 

To show that / is injective, suppose /(o) = /(o'), for o, o' G O. Then we have 
pr'iyKio, 7t 2 o) = pr' ( ttio' ,tt 2 o'). Apply 7 t^ to each side and then use principle (a), 
and it follows that 7TiO = 7Tic/. And likewise n 20 = tt 2 o' . Therefore pr(irio, ^ 20 ) = 
pr(iTio', 7T2o'). Whence by condition (b), o = o'. 

To show that / is surjective, take any o' G O'. Then put o = pr^o' ,n' 2 o'). 
By the definition of /, /(o) = pr'(nio, tt 2 o)’, plugging the definition of o twice 
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into the right hand side and simplifying using rules (a) and (b) confirms that 
/(o) = o'. 

So / is a bijection with the right properties. And since every o G O is pr{x, y) 
for some x,y, the requirement that f(pr(x,y)) = pr'(x,y) fixes f uniquely. □ 

(c) Here’s another simple theorem, to motivate the final definition in this sec- 
tion: 

Theorem 20. Suppose X, Y, O are sets of objects, and the functions ni : O — > X, 
7T2 : O — ^ Y are such that there is a unique function pr:X,Y — >■ O such that 

(a) (V x G X)(\/y G Y)(TTi(pr(x,y)) = x A ir 2 (pr(x,y)) = y). Then [O, pr, m, 7t 2 ] 
form a pairing scheme. 

Proof. We argue that the uniqueness of pr ensures that the pairing function is 
surjective, and then that its surjectivity implies that condition (b) from Defn. 24 
holds as well as the given condition (a). 

Suppose pr is not surjective. Then for some o G O, there is no x G X,y G 
Y such that pr(x,y) = o. So pr(-7Tio, 7r 2 o) = o' ^ o. Consider then function 
pr' which agrees with pr on all inputs except that pr' (7Tio, 7r 2 o) = o. Then 
for all cases other than x = wi o,y = 7 r 2 o we still have iri(pr'(x, y)) = x A 
7t 2 (pr' (x, y)) = y, and by construction for the remaining case Wi (pr'^io, 7t 2 o)) = 
7 Tio A Tr 2 (pr' ( ttiO, 7t 2 o)) = 7r 2 o. So condition (a) holds for pr' , where pr' ^ pr. 
Contraposing, if pr uniquely satisfies the condition, it is surjective. 

Because pr is surjective, every o G O is pr{x , y) for some x, y. But then by (a) 
7Tio = x A 7t 2 o = y , and hence pr(irio, i r 2 o) = pr(x, y) = o. Generalizing gives us 

(b) . □ 

Pairing up X with Y through a pairing scheme, then, gives us a product-object 
O. But we don’t want to identify the resulting product simply with the set O 
(for it depends crucially on the rest of the pairing scheme what role O plays). 
Our last theorem, however, makes the following an appropriate definition: 

Definition 25. If X, Y are sets, then [0, 7^, 7t 2 ] form a product of X with Y, where 
O is a set, and tti : X — ► O, n 2 : Y —tO are functions, so long as there is a unique 
two-place function pr: X,Y — > O such that (Vx G X)(\/y G Y)(m(pr(x,y)) = 
x A ir 2 (pr(x,y)) = y). <1 

6.3 Binary products, categorially 

We have characterized pairing schemes and the resulting products they create 
in terms of a set of objects O being the source and target of some appropriate 
morphisms satisfying the principles in Defns. 24 and 25. Which all looks highly 
categorial in spirit (see the preamble to the previous chapter). 
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But our very natural story is not categorial quite as it stands. For a quite 
crucial ingredient, namely the pairing function pr : X, Y — > O, is a binary func- 
tion (taking as input two objects, not a single pair-object). But the arrows in a 
category are always unary, i.e. have just a single domain. So how can we get a 
properly categorial version of our story about pairing schemes? 

To work up to an answer, suppose for a moment we are working in a well- 
pointed category like Set, where ‘elements’ in the sense of Defn. 21 do behave 
sufficiently like how elements intuitively should behave. In this case, instead of 
talking informally of elements x of X and y of Y, we can talk of two arrows 
x: 1 — > X and y: 1 — > Y. Suppose then that there is an object O and two 
arrows, ni : O — > X and 7T2 : O — > Y such that for every x and y there is a 
unique arrow u: 1 — > O such the following commutes: 


1 



Our arrow u picks out an element in O to serve as the product-object pr(x,y). 
And the requirement that, uniquely, 7Ti o u = x A 7T2 o u = y is an instance of 
the general condition in Defn. 25, re-written in terms of elements-as-arrows. So 
we can naturally say that [O, 7Ti, 7T2] here form a product of X with Y . 

So far so good. But this will only give us what we want in well-pointed cat- 
egories with ‘enough’ elements-as-arrows. However, we know how to generalize 
to other categories: replace talk about elements (points) with talk of generalized 
elements. Which motivates the following definition: 

Definition 26. In any category a (binary) product [ 0 , 7Ti, 7^] for the objects X 
with Y is an object O together with ‘projection’ arrows 7 iq : O — >■ X, n 2 : O — > Y, 
such that for any object S and arrows /j : S — > X and / 2 : S — > Y there is 
always a unique ‘mediating’ arrow u: S — > O such that the following diagram 
commutes: 


S 




This unique mediating arrow u can more helpfully be labelled ‘(/i, fa}’- <1 

Note, by the way, that we are falling into the following common convention: 
in a category diagram, we use a dashed arrow — -> to indicate an arrow which is 
uniquely fixed by the requirement that the diagram commutes. 

Let’s have some examples. 
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(1) In Set, as you would certainly hope, the usual Cartesian product, the set 
X x Y of Kuratowksi pairs (x,y) of elements from X and Y, together 
with the obvious projection functions (. x , y) 4- 4 x and (x, y) y form a 
binary product. 

Let’s just confirm this. Suppose we are given any set S and functions 
/i : S — > X and / 2 : S — > Y. Then if, for s € S, we put u(s) = (/i(s), / 2 (s)), 
the diagram evidently commutes. Now trivially, for any pair p € X x Y , 
p = (7 Tip, 7 r 2 p) . Hence if v ! : S — > X x Y is another candidate for completing 
the diagram, u(s) = (fi(s), f-2(s)) = (ttiu'(s), n2u'(s)) = u'(s). So u is 
unique. 

Motivated by this paradigm case, we will henceforth often use the notation XxY 
for the object O in a binary product [0,77 i, 77 2 ] for X with Y . 

Continuing our examples: 

(2) In Grp, you can construct a product of the groups (G, •) and (77,©) as 
follows. Take the product object to be (G x H, x), i.e. the usual Cartesian 
product of the underlying sets, and let the group operation be defined 
component- wise, so that ( g , h) x (g' , h') = (g ■ g’,h © h'). Now equip this 
group with the obvious projection functions from G x H to G (resp. to H) 
which send (5, h) to g (resp. to h). 

(3) Take a poset ( P , =<!) considered as a category (so there is an arrow p — > q 
iff p =4 q)- Then a product of p and q would be an object c such that 
c =<! p, c =4 q and such that for any object cL with arrows from it to p and 
<7, i.e. any d such that d =4 P,d 4 q, there is a unique arrow from d to c, 
i.e. d =4 c. That means the product of p and q must be their greatest lower 
bound (equipped with the obvious two arrows). 

Since pairs of objects in posets need not in general have greatest lower 
bounds, that goes to show that a category in general need not have prod- 
ucts. 

(4) Here’s a new example of a category, call it Prop L its objects are proposi- 
tions, wffs of a given first-order language Jz? , and there is a unique arrow 
from X to Y iff X 1= Y, i.e. iff X semantically entails Y . The reflexiv- 
ity and transitivity of semantic entailment means we get the identity and 
composition laws which ensure that this is a category. 

In this case, one product of X with Y will be the conjunction X AY 
(with the obvious projections X AY — > X, X AY -aY). 

6.4 Products as terminal objects 

Here’s a slightly different way of putting things. Let’s say 

Definition 27. A wedge to X and Y (in category tf) is an object S and a pair 
of arrows fi : S — > X, / 2 : S — > Y. 
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h 


X 


A wedge q 


is a product for X with Y iff, for any other wedge g 




Y 


to X and Y, there exists a unique morphism u such that the following diagram 
commutes: 



We can say fi ‘factors’ as 7Ti o u and / 2 as 7 r 2 o u, and hence the whole wedge 
from S into X and Y (uniquely) factors through the product via the mediating 
arrow u. 

The definition of a product, now using the notion of wedges, can be recaptured 
as follows. First, we say: 


Definition 28. Given a category Y and objects ^-objects X, Y, then the derived 
wedge category Y’w(xy) has as the following data. Its objects are all the wedges 
[0,/i,/ 2 ] to X, Y. 1 And an arrow from [0,/i,/ 2 ] to [O' , f!f\ is a ^-arrow 
g: O — > O' such that the two resulting triangles commute: i.e. /i = f[ o g, 
/ 2 = f 2 o g. The identity arrow on [O, /i,/ 2 ] is 1 o, and the composition of 
arrows in Yw(xy) is the same as their composition as arrows of Y? . < 

It is easily confirmed that ^w(XY) is indeed a category. And it is trivial to show, 
for future reference, that 


Theorem 21. If g: [0,/i,/ 2 ] — > [0 / ,/i,/ 2 ] is an isomorphism in Y?w(xy)> then 
g: O — >■ O' is an isomorphism in Y? . 

With our new definition of the derived category ^w(xy) to hand, our previous 
definition of a product can be recast like this: 


Definition 29. A product of X with Y in Y> is a terminal object of the derived 
category Y w(X y)- < 


^^Does regarding [0,/i,/2] as an object-in-a-category automatically mean treating it as 
single object in the logician’s sense, a real triple, meaning something over an above its com- 
ponents? Why so? If the object-data in a category can - from a logical point of view - be as 
diverse as object, relations, functions, arrows from other categories, etc., it isn’t obvious why 
they shouldn’t comprise an object equipped with two related arrows. 
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6.5 Uniqueness up to unique isomorphism 

As noted, products need not exist for arbitrary objects X and Y in a given 
category and when they exist, they need not be strictly unique. However, 
when they do exist, they are ‘unique up to unique isomorphism’. That is to say, 

Theorem 22. If both [O, 7Ti,7T2] and [O', 7^,7^] are products for X withY in the 
category to , then there is a unique isomorphism f: O O' commuting with 
the projection arrows (i.e. such that o / = ir\ and tt ' 2 o / = m). 

Note the statement of the theorem carefully. It is not being baldly claimed that 
there is a unique isomorphism between any objects O and O' which are parts of 
different products for some given X, Y. That’s false. For a very simple example, 
in Set, take the standard product object X x X comprising Kuratowski pairs: 
there are evidently two isomorphisms between it and itself, given by the maps 
( x,x ') i ^ (x,x'), and (x,x') H > (x',x). The claim is, to repeat, that there is a 
unique isomorphism between any two product objects O and O' which respects 
their associated projection arrows. 

Plodding proof from basic principles. Since [0, 7Ti, 7^] is a product, every wedge 
factors uniquely through it, including itself. In other words, there is a unique u 
such that this diagram commutes: 


O 



But evidently putting 1 o for the central arrow trivially makes the diagram com- 
mute. So by the uniqueness requirement we know that 

(i) Given an arrow u : O — > O , if tt \ o u = ~ K \ and 772 o u = 772, then u = lo - 

Now, since [O', 7^, tt' 2 ] is a product, [0, 7Ti, 7 t 2 ] has to uniquely factor through it: 


O 



In other words, there is a unique / : O O' commuting with the projection 
arrows, i.e. such that 

(ii) tt[ o f = m and 7r^ o / = 7r 2 . 

And since [0,7Ti,7r 2 ] is also a product, the other wedge has to uniquely factor 
through it. That is to say, there is a unique g: O’ O such that 
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(iii) 7 Ti o g = 7 r^ and 7^0 g = ir' 2 - 
Whence, 

(iv) 7Ti o g O f = 7T^ o / = 7Ti and 7T 2 o g O f = 7T2- 

From which it follows - given our initial observation (i) that 

(v) go f = l 0 

The situation with the wedges is symmetric so we also have 

(vi) / o g = 1 Q , 

Hence / is an isomorphism. □ 

However, you’ll recognize the key proof idea here is akin to the one we used 
in proving that initial/terminal objects are unique up to unique isomorphism. 
And we indeed can just appeal to that earlier result: 

Proof using the alternative definition of products. [0, 7Ti, 7r 2 ] and [O', 7 ^, 7 ^] are 
both terminal objects in the wedge category ^w(xy)- So by Theorem 14 there 
is a unique ffwtXY ) -isomorphism / between them. But, by definition, this has 
to be a -arrow f:0 — » O' commuting with the projection arrows. And by 
Theorem 21, / is an isomorphism in . □ 

6.6 ‘Universal mapping properties’ 

Let’s pause for a moment. We have defined a binary product for X with Y 
categorially as a special sort of wedge to X and Y. 

Now, that doesn’t fix a product absolutely; but we have now seen that prod- 
ucts will be ‘unique up to unique isomorphism’. And what makes a wedge a 
product for X with Y is that it has a certain universal property - i.e. any other 
wedge to X and Y factors uniquely through a product wedge via a unique map. 

We can say, then, that products are defined by a universal mapping property. 
We’ve already met other examples of universal mapping properties: terminal and 
initial objects are defined by how any other object has a unique map to or from 
them. We will meet lots more examples. 

It is perhaps too soon, however, to attempt a formal definition of what it is to 
be defined by a universal mapping property. So for the moment take the notion 
as an informal gesture towards a common pattern of definition which we can 
learn to recognize when we come across it. 
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We continue to explore binary products, and then explain how to use them to 
categorially implement two-place functions. We show this implementation being 
put to work before going on to discuss products of more than two objects. The 
chapter finishes by considering the duals of products, namely coproducts. 

7.1 More properties of binary products 

(a) We start by verifying some more respects in which binary products behave 
just as you would hope: 

Theorem 23. In a category which has a terminal object 1 and has the relevant 
products, 

(1) lxX=X=Xxl 

(2) X xY = Y x X 

(3) X x (Y x Z) = (X x Y) x Z. 

Proof. (1) Evidently the wedge X — X — ■ — > 1 exists for some unique 
arrow ! since 1 is terminal. Hence if X x 1 is a product, there is a unique u such 
that this diagram commutes: 


x X X 1 


Hence, in particular, tt\ o u = lx- 

Now consider the arrow u o m: X x 1 X x 1 . We have 

(a) 7T i O (u O 7Tl) = (tT 1 o u) O 7Ti = 7Tl. 

(b) 7T2 o (u o 7Ti) = 7T2 (both sides being arrows from X x 1 to the terminal 
object 1, hence necessarily equal). 
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We now appeal to the principle (i) at the beginning of the plodding proof of 
Theorem 22 to conclude u o m = Ia'xi- So m has a two-sided inverse, i.e. is an 
isomorphism, whence 1 = 1x1. 

Similarly, of course, for the other half of (1). 

(2) If [X x Y, 7Ti : X x Y — > X, 7r 2 : X x Y — > Y] is a product of X with Y, 
then [X x Y, 7T2, 7 Ti] is obviously a product of Y with X. Hence - by Theorem 22 
- there is an isomorphism between the object in that product and the object 
Y x X oi any other product of Y with X. 

(3) It is a just-about-useful reality check to prove this by appeal to our initial 

definition of a product, using brute force. You are invited to try! But we give a 
slicker proof in §7.5. □ 

(b) We next gather together, for later use, a natural definition and a handful of 
further simple results. First, it is worth noting that our product-style notation 
for mediating arrows to products doesn’t mislead, because we have: 

Theorem 24. If </i, / 2 ) = {91,92), then fi = 31 and f 2 = 92- 

Proof. Being equal, (/1, / 2 ) and (31,32) must share as target the object in some 
product [1x7, 7Ti, 7 t 2 ]. We therefore have fi = TTio{f 1 , / 2 ) = 77*0 (31,32) = 3;. □ 

Here’s a sort-of-converse: 

Theorem 25. Given a product [Xx Y, 7Ti,7t 2 ] and arrows S 1 1x7, then, 
if 7Ti o u = 7Ti o v and 7r 2 o u = 7t 2 o v, it follows that u = v. 

Proof. We have in fact already seen this result for the special case where v is the 
identity arrow. Another diagram shows all we need to prove the general case: 


S 



The same wedge X S — > Y factors through 1x7 both via u and v hence, 
by uniqueness of mediating arrows, u = v. □ 

Definition 30. Suppose we are working in a category with the relevant prod- 
ucts. Then the wedge X < 1a X lx > X must factor uniquely through the 
product X x X via an arrow Sx ■ X — > X x X . That unique arrow 5 , \ is the 
diagonal morphism on A. <1 

In Set, thinking of X x X in the usual way, 6 x sends an element x £ X to { x , x) 
(imagine elements (x, x) lying down the diagonal of a two-dimensional array of 
pairs (x,y): hence the label ‘diagonal’ and the notation). 
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Theorem 26. Given an arrow q : S — ► X , 5 x ° q = (q, q}- 


Proof. Consider the following diagram: 



The inner triangles commute, hence Sx ° q is a mediating arrow factoring the 
wedge X < q — S — q > X through the product X x X. But by definition, 
the unique mediating arrow which does that is (q,q). □ 

Theorem 27. Assuming the ( f,g ) and e compose, (f,g) oe = (f oe, go e). 


Proof. Another, rather similar, diagram gives the proof: 


R 



Again the inner triangles commute, hence (f,g)oe is a mediating arrow factoring 
the wedge with apex R through the product X x Y. But by definition, the unique 
mediating arrow is (/oe,joe). □ 


Theorem 28. Given parallel arrows S 
least) four distinct arrows S — > X x X. 



X , with fi yf / 2 , there are (at 


Proof. By definition of the product, for each pair of indices i,j € {1, 2} there is 
a unique map (fi, ff) which makes the product diagram commute, 

S 



It is immediate from Theorem 24 that if (fi, ff) = (/*,, ff), then i = k,j = l. So 
each of the four different pairs of indices tally different arrows (fi, ff). □ 
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(c) Finally in this section, again for future use, we should remark on a non- 
theorem. Suppose we have a pair of parallel composite arrows built up using 

/ 

the same projection arrow like this: X x Y — x \ X' . In Set, the 

projection arrow here just ‘throws away’ the second component of pairs living 
in X x Y, and all the real action happens on X, so if / o m = g o 7Ti, we should 
also have / = g. Generalizing, we might then suppose that, in any category, 
projection arrows in products are always right-cancellable, i.e. are epic. 

This is wrong. Here’s a brute-force counterexample. Consider the mini cat- 
egory with just four objects together with the following diagrammed arrows 
(labelled suggestively but noncommittally), plus all identity arrows, and the 
necessary two composites: 

X' j f X ^ V ” 2 > Y 

9 

If that is all the data we have to go on, we can consistently stipulate that in 
this mini-category f ^ g but f o m = g o tt\. Now, there is only one wedge of 

the form X i ? » Z , so trivially all wedges of that shape uniquely 

factor through it. In other words, the wedge X 1 — V — — Y is trivially 
a product and 7Ti is indeed a projection arrow. But by construction it isn’t epic. 


7.2 Maps between two products 


(a) Suppose we have two arrows /: X X', g: Y — > Y'. Then we might want 
to characterize an arrow / x g: X xY — > X' xY' which works component-wise 
- i.e., putting it informally, the idea is that / x g sends the product of elements 
x and y to the product of f(x) and g{y). 

In more categorial terms, we require /x g to be such that the following diagram 
commutes: 


X ^ Xxf ^ > Y 


fxg 


X’ X’ x Y' 


4 Y’ 


Note, however, that the vertical arrow is then a mediating arrow from the wedge 

X> j7 07Tl x x Y gQ7T2 > Y' through the product X’ x Y' . Therefore / x g is 
indeed fixed uniquely by the requirement that that diagram commutes, and 
equals (/ o 7Ti, g o 7r2). This warrants the following definition as in good order: 

Definition 31. Given the arrows /: X — » X' , g: Y — >• Y' , and the products 
[X x Y, 7 r 1 , 7 r 2 ] and \X' x Y' , 7 Tj_ , 7 r r 2 ], then / x g: X xY — > X' x Y' is the unique 
arrow such that o / x g = f o 77 1 and tt ' 2 o / x g = g o tt2- <1 
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(b) Here’s a special case: sometimes we have an arrow / : X — x X' and we want 
to define an arrow from Ixf to X' x Y which applies / to the first component 
of a product and leaves the second alone. Then / x ly will do the trick. 

It is tempting to suppose that if we have parallel maps /, g: X — x X' and 
/ x ly = j x ly, then / = g. But this actually fails in some categories - 
for example, in the toy category we met in §7.1 (c), whose only arrows are as 
diagrammed 

X' ( f X ^ V " 1 2 > Y 

9 

together with the necessary identities and composites, and where by stipulation 
/ =/= g but / o 7Ti = g o 7Ti (and hence / x ly = g x ly). 

(c) Later, we will also need the following (rather predictable) general result: 
Theorem 29. Assume that there are arrows 

X x X' x X" 

Y x Y' x Y" 

Assume there are products [X x Y, 7ri,7r 2 ], [X' x Y' , 7r^, Tt' 2 ] and [X" x Y", n”, 
Then (j x k) o (/ x g) = (j o /) x (k o g). 

Proof. By the defining property of arrow products applied to the three different 
products we get, 

tt" o ( j x k) o (/ x g) = j o Tiy o (/ x g) = j o / o tti = 7r" o (j o /) x (fc o gr). 
Similarly 

71-2 ° (j X k) O (/ x g) = 7T2 o (j o /) x (k O g) 

The theorem then immediately follows by Theorem 25. □ 

7.3 Two-place functions 

Let’s now return to the issue of two-place functions which we sidestepped in 
§6.3, and consider how such functions get implemented in category theory. 

It might be helpful to recall first how a couple of other familiar frameworks 
manage to do without genuine multi-place functions by providing workable sub- 
stitutes: 

(1) Set-theoretic orthodoxy models a two-place total function from numbers to 

numbers (addition, say) as a function / : N 2 — X N. Here, N 2 is the cartesian 
product of N with itself, i.e. is the set of ordered pairs of numbers. And 
an ordered pair is one thing not two things. So, in set-theory, a function 
/ : N 2 — X N is in fact strictly speaking a unary function , a function that 
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maps one argument, an ordered pair object, to a value, not a real binary 
function. 

Of course, in set-theory, for any two things there is a pair-object that 
codes for them - we usually choose a Kuratowski pair - and so we can 
indeed trade in a function from two objects for a related function from 
the corresponding pair-object. And standard notational choices can indeed 
make the trade quite invisible. Suppose we adopt, as we earlier did, the 
modern convention of using ‘(to, n)’ as our notation for the ordered pair of 
to with n. Then ‘/(m,n)’ invites being parsed either way, as representing 
a two-place function /(• , •) with arguments m and n or as a corresponding 
one-place function /• with the single argument, the pair ( m,n ). But note: 
the fact that the trade between the two-place and the one-place function 
is notationally glossed over doesn’t mean that it isn’t being made. 

(2) Versions of type theory deal with two-place functions in a different way, 
by a type-shifting trick. Addition for example - naively a binary function 
that just deals in numbers - is traded in for a function of the type N — > 
(N — » N). This is a unary function which takes one number (of type N) to 
outputs something of a higher type, i.e. a unary function (of type N — > N). 
We then get from two numbers as input to a numerical output in two steps, 
by feeding the first number to a function which delivers another function 
as output and then feeding the second number to the second function. 

This so-called ‘currying’ trick of course is also perfectly adequate for 
certain formal purposes. But again a trade is being made. Here’s a revealing 
quote from A Gentle Introduction to Haskell on the haskell.org site (Haskell 
being one those programming languages where what we might think of 
naturally as binary functions are curried): 

Consider this definition of a function which adds its two argu- 
ments: 

add :: Integer— ► Integer — ► Integer 
add x y = x + y 

So we have the declaration of type - we are told that add sends a number 
to a function from numbers to numbers. We are then told how this curried 
function acts ... but how? By appeal, of course, to our prior understand- 
ing of the familiar school-room two-place addition function! The binary 
function remains an essential rung on the ladder by which we climb to an 
understanding of what’s going on in the likes of Haskell (even if we propose 
to throw away the ladder after we’ve climbed it). 

Now back to categories. We don’t have native binary morphisms in category 
theory. Nor do we get straightforward currying within a category, at least in the 
sense that we won’t have arrows inside a category whose target is another arrow 
of that category (though we will meet a version of currying when we get round 
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to talking about exponentials in categories). So what about using a version of 
the set-theoretic trick? 

This, as we’ve seen, is the line to follow. We can in a noncircular way give a 
categorial treatment of pair-objects as ingredients of products. And with such 
objects now to hand, an arrow of the kind f:XxY — > S is indeed available 
do duty for a two-place function from an object in X and an object in Y to a 
value in S. So this will be our implementation device. Let’s pause to illustrate 
it being put to work together with our account of maps between two products. 


7.4 Example: groups in categories 


(a) We have already seen that there is a category Grp comprising all groups, 
and that any individual group can also be considered as a category in its own 
right. In this section we return to talk about groups again, and show how to 
characterize groups categorically. 

We informally think of a group as some objects G equipped with a binary 
operation m - group multiplication - and with a designated element e which 
is an identity for the group operation. The group operation is associative, and 
every element has a two-sided inverse. 

Now, how can we characterise such a structure living in Set? We need a set 
G of elements, and three arrows (which are functions in this category): 

(i) m: G x G — > G (so here we are, as announced, trading in the informal 
binary function for an arrow from a binary product), 

(ii) e: 1 — x G (we’ve met before the trick of using this sort of arrow as a way 
of giving a designated element e), 

(iii) i: G — > G (this is going to be the arrow that sends an element to its 
inverse) . 

(We’ll let context, informal vs. categorial, disambiguate the notation!) And to 
get a group, we need to impose constraints on these arrows corresponding to the 
usual group axioms: 

(1) Informally, we require the group operation m to be associative. Categori- 
ally, consider the following diagram: 


(Gl) 


(G x G) x G 


mx 1 g 

GxG — 


-)Gx(GxG) 


->Gf 


lcXm 

GxG 


Here the arrow at the top represents the naturally arising isomorphism 
between the two triple products that is (or rather, will be) established by 
the proof of Theorem 23 (3). 
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Remembering that we are working in Set, an element (( j,k),l } e (Gx 
G) x G. Going round on the left, that gets sent to (■ m(j , k),l) and then to 
m(m(j , k) , l) . Going round the other direction we get to So 

requiring the diagram to commute captures the associativity of m. 

(2) Informally, we next require e to act as a multiplicative identity: i.e. , for all 
g GG, m(g, e) = g = m(e, g). 

To get a categorial equivalent to that condition, start by defining the 
map e! : G — X G by composing G -4 s 1 G. In Set we can think of e! 
as the function which sends anything in G to the identity element of the 
group. We then have the following product diagram: 


G 



and we can think of the mediating arrow ( 1 < 3 , e!) as sending an element 
g G G to the pair (g,e). The element e then behaves like a multiplicative 
identity on the right if m sends this pair (g, e) back to g i.e. if the top 
triangle in the following diagram commutes: 



Similarly the lower triangle commutes just if e behaves as an identity on 
the left. So, for e to behave as an identity element, it is enough that the 
whole diagram commutes. 

(3) Finally, we informally require that every element g G G has an inverse g^ 1 
or i(g) such that m(g,i{g)) = e = m(i(g),g). Categorially, we can express 
this by requiring that the following commutes: 


(G3) 


GxG <- 


G ^ > GxG 


GxG 


> G < 


GxG 


For take an element g G G. Going left, the diagonal arrow Sq (from 
Defn. 30) maps it to the pair (g,g), which is mapped in turn by 1 g x i to 
(g,i(g)) and then by m to m(g,i(g)). Therefore, the requirement that the 
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left square commutes tells us, as we want, that m(g,i(g)) = e. Similarly 
the requirement that the right square commutes tells us that m(i(g),g) = e 

In summary then, the informal group axioms correspond to the commutativity 
of our last three diagrams. 

But note immediately that this categorial treatment of groups only requires 
that we are working in a category with binary products and a terminal object. 
So it is natural to generalize, as follows: 

Definition 32. Suppose is a category which has the relevant binary products 
and a terminal object. Let G be a ^-object, and m: G x G — > G, e: 1 — > G 
and i : G — > G be ^-arrows. Then [G,m,e,i] is a group-object in iff the three 
diagrams (Gl), (G2), (G3) commute, where e! in the latter two diagrams is the 

composite map G -4 1. 4% G. 

Here, ‘group object’ (rather than plain ‘group’) is the standard terminology: 
some alternatively say ‘internal group’. 

Then, if we don’t fuss about the difference between an arrow e : 1 — > G (in a 
group object) and a designated element e (in a group), we have established the 
summary result 

Theorem 30. In the category Set, a group object is a group. 

And conversely, every group or to be pernickety, every group which hasn’t got 
too many elements to form a set - can be regarded as a group object in Set. 

(b) Here are just a few more examples of group objects living in different 
categories: 

Theorem 31. (1) In the category Top, which comprises topological spaces with 

continuous maps between them, a group object is a topological group in the 
standard sense. 

(2) In the category Man. which comprises smooth manifolds with smooth maps 
between them, a group object is a Lie group. 

(3) In the category Grp, a group object is an abelian group. 

Proof. The proofs of (1) and (2) are predictably straightforward if you know the 
usual definitions of topological groups and Lie groups, and we won’t pause over 
them. 

Claim (3), however, is more unexpected. But the proof is relatively straight- 
forward, rather cute, and a rather useful reality-check. So let’s present it. 

Suppose [G,m,e,i] is a group-object in Grp. Then the object G is already a 
group of objects G equipped with a group operation and an identity element. 
We’ll use ordinary multiplication notation for the operation, as in l x ■ y\ and 
we’ll dub the identity ‘ 1 ’ (so the group G is notated with dots!). The arrow 


51 



Products explored 


e : 1 — > G in the group object also picks out a distinguished element of G, call it 
‘1’, an identity for m. 

Now, each arrow in the group-object [G,m,e,i\ lives in Grp, so is a group 
homomorphism. That means in particular m is a homomorphism from GxG (the 
product group, with group operation x ) to G. So take the elements x, y,z,w £ G. 
Then, 

m(x ■ z,y • w) = m((x, y) x (z, w)) = m(x, y) ■ m(z, w) 

The first equation holds because of how the operation x is defined for the product 
group; the second equation holds because m is a homomorphism. 

For vividness, let’s rewrite m(x,y) as x-ky (so 1 is the unit for *). Then we 
have established the interchange law 

(x ■ z) * (y ■ w) = (a: * y) ■ (z * w). 

We will now use this law twice over. First, we have 

i = i. i = (i*i) ■ (i*I) = (i • i)*(i- 1) = i*i = i 

We can therefore just write 1 for the shared unit, and show secondly that 

x ■ y = {x * 1) • (1 * y) = (x • 1) * (1 • y) = x * y 
= (1 ' x) * {y • 1) = (1 * y) ■ {x * 1) = y ■ x. 

We have shown, then, that if [G, to, e, *] is a group object in Grp, G’s own group 
operation commutes, and m is the same operation so that must also commute. 
Therefore the group object is indeed an abelian group. (We can, but won’t, also 
show that every abelian group can be regarded as a group object in Grp.) □ 

(c) We could continue the story. We could categorially define a group homo- 
morphism between two group objects in a category ‘if, and that would in turn 
enable us to define a category of groups living in . However, we won’t pursue 
this further in this chapter. We’ve done enough for our present purposes; we 
have given an initial illustration of how products in categories can be put to 
work in defining structures within categories. More on this and related themes 
in due course. But for now, it’s back to the more basic discussion of products. 
Two major themes remain - generalization and dualization. 

7.5 Products generalized 

(a) So far we have talked of binary products. But we can generalize in obvious 
ways. For example, 

Definition 33 . In any category a ternary product [0, 7Ti, 7r 2 , 773] for the objects 
Xi,X 2 ,X 3 is an object O together with projection arrows 7Tj: O — > X, (for 
i = 1,2,3) such that for any object S and arrows /, : S X t there is always a 
unique arrow arrow u: S — > O such that /,; = 7Tj o u. <1 
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And then, exactly as we would expect, using just the same proof ideas as in the 
binary case, we can prove 

Theorem 32. If both the ternary products [O, 7Ti, 7r2, 713] and [O', 71-j, 7^, 7Tg] exist 
for X \ , X2 , A3 in the category ho , then there is a unique isomorphism f : O 
O' commuting with the projection arrows. 

We now note that if ^ has binary products for all pairs of objects, then it has 
ternary products too, for 

Theorem 33. (Ad x X2) x A3 together with the obvious projection arrows forms 
a ternary product of X 1, Ah, A3. 

Proof. Assume [Ah x A 2 ,7 Ti,7t 2 ] is a product of Ah with Ah, and [(Ah x Ah) x 
A3, Pi > P2] is a product of Ah x Ah with A 3 . 

Take any object S and arrows /) : S — > Ah. By our first assumption, (a) there 
is a unique it: S — > Ah x A 2 such that / 1 = iri o u , / 2 = 7t 2 o it. So by our 
second assumption (b) there is then a unique v: S — >■ (Ah x X 2 ) x Ah such that 
u = Pi O V, f 3 = p 2 ° V. 

Therefore fi=niop 1 o v, / 2 = 7t 2 o pi o v , f 3 = p2°v 

So now consider [(Ai xl 2 ) x A3,7Tiop 1 ,7r 2 o/3 1 ,p 2 ]. This, we claim, is indeed a 
ternary product of Ah, Ah, Ah. We’ve just proved that S and arrows /) : S —> Ah 
factor through the product via the arrow v. It remains to confirm v’s uniqueness 
in this new role. 

Suppose we have w : S — > ( Ah x A 2 ) x A3 where /1 = niop^w, / 2 = ir 2 opiow, 
fi = P2 0 w. Then p\ o w: S — » Xi x A 2 is such that /1 = o ( p 1 o w), 
f 2 = 7 t 2 o [pi ow). Hence by (a), u = p\ ow. But now invoking (b), that together 
with f 3 = P2 ° w entails w = v. □ 

Evidently, an exactly similar argument will show that Ai x (A 2 x A3) together 
with the obvious projection arrows forms a ternary product of Ai, A 2 , A3. Hence 
we are now in a position to neatly prove 

Theorem 23. ( 3 ) A x (F x Z) ^ (A x Y) x Z. 

Proof. Both (Ai x A 2 ) x A3 and Ai x (A 2 x A3) (with their projection arrows) 
are ternary products of X 3 , A 2 , A 3 . So Theorem 32 entails that Ai x (A 2 x A 3 ) = 
(Ai x A 2 ) x A 3 . □ 

(b) What goes for ternary products goes for n-ary products defined in a way 
exactly analogous to Defn. 33 . If ^ has binary products for all pairs of objects it 
will have quaternary products such as ((Ai x A 2 ) x A 3 ) x A4, quinary products, 
and n-ary products more generally, for any finite n > 2. 

To round things out, how do things go for the nullary and unary cases? 
Following the same pattern of definition, a mdlary product in would be an 
object O together with no projection arrows, such that for any object S there 


53 



Products explored 


is a unique arrow u: S — ► O. Which is just to say that a nullary product is a 
terminal object of the category. 

And a unary product of X would be an object O and a single projection arrow 
7Ti : O — > X such that for any object S and arrow f:S—*X there is a unique 
arrow u: S — >• O such that ir o u = f. Putting O = X and tt = lx evidently 
fits the bill. So the basic case of a unary product of X is not quite X itself, but 
rather X equipped with its identity arrow (and like any product, this is unique 
up to unique isomorphism). Trivially, all unary products exist in all categories. 
In sum, suppose we say 

Definition 34. A category Y has all binary products iff for all "^-objects X and 
Y, there exists a binary product of X with Y in Y . 

Y has all finite products iff Y has n-ary products for any n objects, for all 
n > 0. <1 

Then our preceding remarks establish 

Theorem 34. A category Y has all finite products iff Y has a terminal object 
and has all binary products. 

(c) We can generalize further in the obvious way, beyond finite products to 
infinite cases. We will however be interested in cases where we are taking a 
product of objects which are not too many to form a set - i.e. we can take the 
objects to be indexed by the elements of some index set I. 

Definition 35. In any category Y, the product of the “^-objects X t (for indices 
i £ /), if it exists, is an object O together with projection arrows 7r,; : O — > Xi 
(for i £ I) such that for any object S and arrows /» : S — > X; (again for i £ I), 
there is always a unique arrow arrow u: S — » O such that /, = 7r, o u. (We can 
use ‘ J([ Xf to notate such anO.) <1 

is/ 

As before, these generalized products will be unique up to unique isomorphism. 
And we will say 

Definition 36. A category Y has all (small) products iff for any set’s worth of 
^-objects - i.e. any objects ^-objects X,, for i £ I, where I is some set - the 
objects in the set have a product. <1 

Here, ‘small’ doesn’t mean small by any normal standards! - it just indicates 
that we are taking products over collections of objects that are not so big that 
they can’t be indexed by a set living somewhere in the usual universe of sets. 
We’ll be returning to such issues of size in due course. 


7.6 Coproducts 

(a) Let’s note a common terminological device: 
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Definition 37. For very many kinds of categorially defined widget, a co-widget 
of the category C is a widget of C op : co-widgets are dual to widgets. <1 

For example, we have met co-slice categories, the duals of slice categories. We 
could (and a few do) call initial objects ‘co-terminal’. Likewise we could (and a 
few do) call sections ‘co-retractions’. True, there is a limit to this sort of thing - 
no one, as far as I know, talks e.g. of ‘co-monomorphisms’ (instead of ‘epimor- 
phisrns’). But still, the general convention is used very widely. In particular, it 
is absolutely standard to talk of the duals of products as ‘co-products’ - though 
in this case, the hyphen is usually dropped. 

(b) The definition of a coproduct is immediately obtained, then, by reversing 
all the arrows in our definition of products. Thus: 

Definition 38. In any category a (binary) coproduct [O, for the objects 
X with Y is an object O together with ‘injection’ arrows L\ : X — > O, (. 2 : Y — > O, 
such that for any object S and arrows /i : X — ► S and / 2 : Y — > S there is always 
a unique mediating arrow v: O — > S such that the following diagram commutes: 


S 



The object O in a coproduct for X with Y is often notated l X © Y’ or ‘X II Y’; 
and we can notate the mediating arrow v by ‘ [fi , / 2 ] . <1 

Note, however, that the ‘injections’ in this sense need not be injective or even 
rnonic. 

Let’s say that objects and arrows arranged as X — O — Y form a 
corner (or we could say ‘co-wedge’) from X and Y with vertex O. Then a co- 
product of X with Y can be thought of as a corner from X and Y which factors 
through any other corner from X and Y via a unique map between the vertices 
of the corners. 

We could now go on to define a category of corners from X and Y on the 
model of a category of wedges to X and Y, and then redefine a coproduct of X 
with Y as an initial object of this category. It is a useful reality check to work 
through the details. 

(c) Let’s have some examples of coproducts. Start with easy cases: 

(1) For Set, the headline news is that disjoint unions are coproducts. 

Given sets X and Y, let X® Y be the set with members (x, 0) for x £ X 
and {y, 1) for y £ Y. And let the injection arrow i\ : X — > X © Y be the 
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function x H ► (x, 0), and similarly let t 2 : Y — > X © Y be the function 
y i->- (y, 1). Then [ X ® Y, ti, r 2 ] is a coproduct for X with Y. 

To show this, take any object S and arrows f\ : X — > S and f 2 : Y — » S, 
and then define the function v : X ® Y — > S' as sending an element (x, 0) 
to fi(x) and an element (y, 1) to f 2 {y)- 

By construction, this will make both triangles commute in the diagram 
in the definition above. 

Moreover, if v' is another candidate for completing the diagram, then 
u'((x,0)) = v'ol i(x) = /i(x) = v((x, 0}), and likewise v'((y, 1)) = v({y, 1)), 
whence v' = v, which gives us the necessary uniqueness. 

(2) Take a poset ( P , =<!) considered as a category (so there is an arrow p — > q 
iff p =4 q)- Then a coproduct of p and q would be an object c such that 
p =4 c, q =4 c and such that for any object d such that p =4 d, q =4 d there is 
a unique arrow from c to d 1 i.e. c =4 d. Which means that the coproduct of 
p and q, if it exists, must be their least upper bound (equipped with the 
obvious two arrows). 

(3) In Prop L (which we met in §6.3) the disjunction X\/ Y (with the obvious 
injections X — jXVb, Y — > X V F) is a coproduct of X with Y. 

However things soon get rather less obvious. We’ll mention a couple more cases. 

But the details here aren’t going to matter, so by all means skip: 

(4) In the category Grp, coproducts are the so-called ‘free products’ of groups. 

Take the groups G = (G, •), H = ( H , 0) (we abuse notation in a familiar 
way, recycling the label for a group’s carrier set as the label for the group, 
and letting context disambiguate). If necessary, now doctor the groups 
to equate their identity elements while ensuring the sets G and H are 
otherwise disjoint. Form all the finite ‘reduced words’ G * H you get by 
concatenating elements from GUH , and then multiplying out neighbouring 
G-elements by • and neighbouring iJ-elements by © as far as you can. 
Equip G * H with the operation ❖ of concatenation-of-words-followed-by- 
reduction. Then G*H = (G*H, o) is a group - the free product of the two 
groups G and H - and there are obvious ‘injection’ group homomorphisnrs 
i \ : G^-G-kH, l 2 : H ~^G*H. 

Claim: [G * H,i\, i 2 \ is a coproduct for the groups G and H. That is to 
say, for any group K = (K, *) and morphisms f \ : G — > K. f 2 : H -* K, 
there is a unique v such that this commutes: 


I< 
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Put v : G-kH — > K to be the morphism that sends a word gih\g 2 h 2 ■ ■ ■ g r 
(. 9i e G, hi G 77) to j(gi)*k(hi)*j(g 2 )*k(h 2 )*---*j(g r )- By construction, 
vo n=j, vo i 2 = k. So that makes the diagram commute. 

Let v' be any other candidate group homomorphism to make the diagram 
commute. Then, to take a simple example, consider gh G G * 77. Then 
v'{gh) = v'(g) * v'{h) = v'(h (g)) * v'{i 2 {h)) = f x (g) * f 2 (g) = v(h(g)) * 
v(ii(g)) = v(ii(g) * ii(g)) = v(gh). And by induction over the length of 
words we’ll get v' = v. So, as required, v is unique. 

(5) So what about coproducts in Ab, the category of abelian groups? Since 
the free product of two abelian groups need not be abelian, the same 
construction won’t work again as it stands. 

OK: hit the construction with the extra requirement that words in GkH 
be treated as the same if one can be shuffled into the other (in effect, further 
reduce G k 77 by quotienting out with the obvious equivalence relation). 
But that means that we can take a word other than the identity, bring 
all the G-elements to the front, followed by all the 77 elements: but now 
multiply out the G-elements and the 77-elements and we are left with two- 
element word gh. So we can equivalently treat the members of our further 
reduced G-kH as pairs (g, h ) belonging to G x 77. Equip this with the group 
operation x defined component-wise as before (in §6.3): this gives us an 
abelian group if G and 77 are. Take the obvious injections, g —A 1) and 
h (1 ,h). Then we claim [G x 77, 11 , 12 ] is a coproduct for the abelian 
groups G and 77. 

Take any abelian group I\ = ( K , *) and morphisms f±: G — ► K, f 2 : 77 — > 
K. Put v: GxH — > K to be the morphism that sends (g, h ) to fi(g)* f 2 (h). 
This evidently makes the coproduct diagram (with G x 77 for G*77) com- 
mute. And a similar argument to before shows that it is unique. 

So, in the case of abelian groups, the same objects can serve as both 
products and coproducts, when equipped with appropriate projections and 
injections respectively. 

(d) By duality, we immediately know lots of properties of coproducts. In par- 
ticular, we have the following composite theorem: 

Theorem 35. If both the coproducts [0 ,l i,l 2 \ and [O', t\, 4] exist for X andY 
in the category ‘rf, then there is a unique isomorphism f : O O' commuting 
with the injection arrows. 

And in a category with an initial object 0 and where the coproducts exist, 

( 1 ) 

(2) X ®Y = Y @ X 

(3) X © (Y © Z) “ (X © Y) © Z 

Further, the notion of a coproduct generalizes beyond the binary case, just as 
with products. Thus, exactly as you would expect, we have: 


57 



Products explored 


Definition 39. In any category the coproduct of the ^-objects X t (for indices 
i £ I) is an object O together with injection arrows ti : Xi —> O (for i £ I) such 
that for any object S and suite of arrows /, : Xi — > S , there is always a unique 

arrow v : O — > S such that /, = v o tj. (We can use ‘ ]Q ’ to notate such an O.) 

i&I 

By dual arguments to those we’ve met for products, nullary coproducts are initial 
objects and unary coproducts are objects equipped with its identity arrow. And 
we can then define what it is for a category to have all binary coproducts and 
all finite coproducts in the obvious ways, and show that a category has all finite 
coproducts iff it has an initial object and all binary coproducts. 
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Terminal and initial objects, products and coproducts, are defined by universal 
mapping properties. In this chapter, we look at another pair of cases, so-called 
equalizers and their duals. 


8.1 Equalizers 

It was useful, when defining products, to introduce the idea of a ‘wedge’ (Defn. 27) 
for a certain small configuration of objects and arrows in a category. Here’s a 
similar definition that is going to be useful in defining the equalizers: 

Definition 40. A fork (from S through X to Y) consists of arrows k: S — > X 
with f-.X—^Y and g: X — »• Y, such that f o k = g o k. <\ 

b / , 

So diagrammatically, a fork looks like this: S — — > X c Y, with the com- 
posite arrows from S to Y being equal. 

Now, a product wedge from O to X and Y is a limiting case, a wedge such 
that any other wedge from S to X and Y uniquely factors through it. Likewise, 
an equalizing fork from E through X to Y is another limiting case, a fork such 
that any other fork from an object S through X to Y uniquely factors through 
it (so this is another definition via a universal mapping property). That is to say 

Definition 41. Let Y? be a category and /, g : X — \ Y be a pair of parallel arrows 
in Y> . Then the object E and arrow e: E — > X form an equalizer in Y’ for those 

„ „ f , 

arrows iff/oe = goe (so E — - — ► X ) Y is indeed a fork), and for any 

. . ... „ 
fork S > X ( Y there is a unique mediating arrow u: S — > E such 

the following diagram commutes: 



<1 
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Note that, just as with products (see Defn. 28), we can give an alternative 
definition which defines equalizers in terms of a terminal object in a category of 
forks. First we say 

Definition 42. Given a category if and parallel arrows f,g: X — > Y, then the 

f 

derived fork category c ^f(xy) has as objects all forks S — ' > X 1 Y. And 

k k' 

an arrow from S — >■ • • • to S' — > • • • in ^ f(xy ) is a if-arrow g: S — ► S' such 
that the resulting triangle commutes: i.e. such that k = k 1 o g. 

k 

The identity arrow in ^ f(xy ) on the fork S — > ■ ■ ■ is the identity arrow lg 
in if; and the composition of arrows in c ^f(xy) is defined as the composition of 
the arrows as they feature in if. <1 

It is again easily checked that this indeed defines a category. Our definition of 
an equalizer then comes to this: 

Definition 43. An equalizer of /, g : X — > Y is some [E,e] such the resulting 
/ 

fork E — - — # X 1 Y is terminal in i ^f(xy)- <1 

Let’s immediately give some examples of equalizers: 

(1) Suppose in Set we have the parallel arrows X \ Y. Then consider 

the set E C X such that x £ E implies fx = gx, and let e : E — > X be the 
obvious inclusion map. We show that [E, e] is an equalizer for / and g. 

_ k f 

By construction, foe — joe. So suppose S — — > X ( Y is 

any other fork through f,g. Since f(k(s)) = g(k(s)) for each s € S, the 
set fcfS 1 ] = E: so defining the mediating arrow u: S — > E to agree with 
k : S — y X on all inputs will make the diagram for equalizers commute. 

It remains to show that this is the unique candidate for the function u. 
But note k = e o u, and e doesn’t change the values of the function (only 
its codomain), so k and u must indeed agree on all inputs. 

(2) Equalizers in categories whose objects are sets-with-structure behave sim- 
ilarly. Take the category Mon, for example. Given a pair of monoid ho- 

/ 

momorphisms (A,-) ) (Y, *), take the subset £ of X on which the 

functions agree. Evidently E must contain the identity element of X (since 
/ and g agree on this element: being homomorphisms, both must send it 
to the identity element of Y). And suppose e,e' £ E: then /(e • e') = 
/(e) * f{e') = g(e) * g(e') = g(e ■ e 7 ), which means that E is closed under 
products of members. 

So take E together with the monoid operation from (A, •) restricted to 
members of E. Then (E, •) is a monoid - for the identity element shared 
with (A, •) still behaves as an identity, and the operation is still associative. 
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And if we take (E,-) and equip it with the injection homomorphism into 

/ 

(A, •), this will evidently give us an equalizer for (A, •) 1 ( Y , *) . 

(3) Similarly, take Top. What is the equalizer for a pair of continuous maps 

/ 

X 1 Y ? Well, take the subset of (the underlying set of) X on which 

the functions agree, and give it the subspace topology. This topological 
space equipped with the injection into X is then the desired equalizer. 
(This works because of the way that the subspace topology is defined - we 
won’t go into details). 

(4) A special case. Suppose we are in Grp and have a group homomorphism, 
/: X — > Y . There is also another trivial homomorphism o: X —>■ Y which 
sends any element of the group X to the identity element in Y, i.e. is the 
composite X — > 1 — > Y of the only possible homonrorphisms. Now consider 
what would constitute an equalizer for / and o. 

Suppose K is the kernel of /, i.e. the subgroup of X whose objects are the 
elements which / sends to the identity element of Y, and let i : K — ► X be 

/ 

the inclusion map. Then K — - — > X 1 Y is a fork since foi = ooi. 

f 

Let S — — > X ) Y be another fork. Now, o ok sends every ele- 
ment of S to the unit of Y. Since / o k = oo k, k must send any element of 
S to some element in the kernel K. So let k ! : S K agree with k : S — > X 
on all arguments. 

Then the following commutes: 

S 


I< 

And evidently k! is the only possible homomorphism to make the diagram 
commute. 

So the equalizer of / and o is /’ s kernel K equipped with the inclusion 
map into the domain of A. Or putting it the other way about, we can 
define kernels of group homorphisms categorially in terms of equalizers. 

_ ... „ . / 

(5) Finally we remark that the equalizer of a pair of maps A 1 Y where 

in fact / = g is simply [A, lx]- 

Consider then a poset (P,=4) considered as a category whose objects 
are the members of P and where there is a unique arrow A — » Y (for 
A, Y £ P) iff A ^ Y. So the only cases of parallel arrows from A to A are 
cases of equal arrows which then, as remarked, have equalizers. So in sum, 
a poset category has all possible equalizers. 
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8.2 Uniqueness again 

Just as products are unique up to unique isomorphism, equalizers are too. That 
is to say, 

Theorem 36. If both the equalizers [E,e] and \E' ,e'\ exist for X j 1 Y, 

then there is a unique isomorphism j : E E' commuting with the equalizing 
arrows, i.e. such that e = e' o j. 

Plodding proof from first principles. We can use an argument that goes along 
exactly the same lines as the one we used to prove the uniqueness of products and 
equalizers. This is of course no accident, given the similarity of the definitions. 

Assume [E, e) equalizes / and g, and suppose eoh = e. Then observe that the 
following diagram will commute 

E 

h 


E 

So h = 1 E makes that diagram commute. But by hypothesis there is a unique 
arrow E — » E which makes the diagram commute. So we can conclude that if 
e o h = e, then h = 1 E . 

Now suppose [E\ e'} is also an equalizer for / and g. Then [E, e] must factor 
uniquely through it. That is to say, there is a (unique) mediating j : E — > E' 
such that do j = e. And since [E,e\ must factor uniquely though [E' ,e'\ there 
is a unique k such that e o k = e' . So e o k o j = e, and hence by our initial 
conclusion, koj = \ E . 

A similar proof shows that j o k = \ E ' ■ Which makes the unique j an isomor- 
phism. □ 

Proof using the alternative definition of equalizers. [E,e] and \E',e'] are both 
terminal objects in the fork category ^(xy)- So by Theorem 14 there is a 
unique ^F(xy)-isomorphism j between them. But, by definition, this has to be 
a Harrow j : E E' commuting with the equalizing arrows. And j is easily 
seen to be an isomorphism in Y; too. □ 

Let’s add two further general results about equalizers. First: 

Theorem 37. If [E, e] constitute an equalizer, then e is a monomorphism. 

r„ , . / 

Proof. Assume [E, e] equalizes X j Y, and suppose e o g = e o h, where 

„ 9 , 

D ) E. Then the following diagram commutes, 
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D 

a h 


E 

f 

So D > X l Y is a fork factoring uniquely through the equalizer, and 

hence there is only one arrow to do the factoring, i.e. g = h. So e is left-cancellable 
in the equation e o g = e o h; i.e. e is nronic. □ 

Second, in an obvious shorthand, 

Theorem 38. In any category, an epic equalizer is an isomorphism 

r _ / 

Proof. Assume again that [E, e] equalizes X l Y, so that / o e = g o e. So 
if e is epic, it follows that / = g. Then consider the following diagram 
X 

X L 4 Y 

E " 

Because e equalizes, we know there is a unique u such that (i) eo u = lx- 
But then also eo(uoe) = lx ° e = e = e o 1 E . Hence, since equalizers are 
mono by the last theorem, (ii) u oe = 1 E . 

Taken together, (i) and (ii) tell us that e has an inverse. Therefore e is an 
isomorphism. □ 

8.3 Subsets and subobjects 

(a) In §8.1, we saw that in Set, given two parallel arrows from an object X, 
a certain subset of X (together with the trivial inclusion function) provides an 
equalizer for the arrows. And §8.2 tells us that this is the unique equalizer, up 
to isomorphism. 

We now note that a reverse result holds too: 

Theorem 39. In Set, any subset S of X together with its natural inclusion map 
i: S — > X form an equalizer for certain parallel arrows from X. 

Proof. A subset S Cl has an associated characteristic function s: X — > {0, 1} 
which sends x G X to 1 (‘true’) if x € S and to 0 otherwise. 

Another function from X to {0, 1} is the completely indiscriminate map c 
which sends everything in X to 1. 

We now show that [S, i\ is an equalizer for X X {0, 1}. 
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First, it is trivial that so* = coi, so as required S — — > X l {0, 1} is 
indeed a fork. 

So it remains to show that any upper fork in this next diagram factors through 
the lower fork via a unique mediating u: 

R 

u\ x ; \ { 0 , 1 } 

s 

Since so/ = co/by assumption, it is immediate that f[R] C S. Hence, if 
we define u : R — > S to agree with / : R — > X on all inputs, then the diagram 
commutes. And this u is evidently the only possible candidate. □ 

(b) Since [S, i] is the unique equalizer for X ] {0, 1} up to isomorphism, 
this gives us a nice categorial way of characterizing subsets in Set as equalizers. 

Here’s a variant way of putting the same thought, taking a step towards further 
generalization. First, we remind ourselves that using the set {0, 1} as the set of 
truth- values is of course arbitrary. So let’s now use Cl to denote some appropriate 
set. And let t: 1 — > Cl then be the map that picks out the truth- value ‘true’ from 
Cl. A characteristic function for a subset of X can now be treated as an arrow 
X ^ Cl. 

Note next that the map coi: S — > Cl , which sends everything in S to the value 
‘true’, is trivially equal to composite map 

S ■ > 1 > Cl 

with 1 a terminal object in the category. Similarly for the map co f: R—^ Cl. 

Now let us generalise by thinking not in terms of a strict inclusion map i but in 
terms of a monomorphism *: S >— > X which ‘injects’ S into X. Then, massaging 
our earlier diagram, the claim that [S, i] is tantamount to a subset of X comes 
to this. There is a map s : X — > Cl (intuitively, the characteristic function for the 
subset S) such that for any f:R—tX such that outer (bent) square commutes, 
there is unique u which makes the whole diagram commute: 
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Now, this is perhaps not as transparent as our first way of putting things. But 
the attraction is that this categorial characterization of subsets in Set can be 
ported to given a useful general definition of subobjects in other categorties. 
More about all this in due course. 

8.4 Co-equalizers 

(a) We dualize our definition of an equalizer to get the notion of a co-equalizer. 
So, as a preliminary, we say 

Definition 44. A co-fork (from X through Y to S ) consists of parallel arrows 
/ : X — ► Y, g: X — »• Y and an arrow k : Y —> S, such that k o f = k o g. <1 

(Actually, plain ‘fork’ is used for the dual too: but the barbarous ‘co-fork’ keeps 

, f , h 

things clear.) So diagrammatically, a co-fork looks like this: X ( Y > S, 

with the composite arrows from X to S being equal. 

Definition 45. Let be a category and f:X^Y and g: X — ► Y be a pair of 
parallel arrows in Y 1 . Then the object C and arrow c: Y — » S form a co-equalizer 
in for those arrows iff c o / = c o g, and for any co-fork from X through Y to 
S there is a unique arrow u : C — > S such the following diagram commutes: 



As we would now expect, by a dual argument, co-equalizers are unique up 
to a unique isomorphism; and since equalizers are monic, co-equalizers are epic. 
We won’t pause to spell out those results, but turn immediately to give just one 
central example. 

(b) What do co-equalizers look like? In Set, the parallel arrows f,g: X — ► Y 
determine a relation R on the members of Y where y\Ry2 holds when there is 
an x £ X such that /( x) = yi A g(x) = 3/2 ■ 

f 

Now if there is a co-fork X ( Y — S, then if y\Ry2 then k(y 1) = 

k (2/2) (and of course, being mapped to the same value by k is an equivalence 
relation). Having equal A;- values is an equivalence relation. So, for every co-fork, 
there is a corresponding equivalence relation such that being A-related implies 
being in that equivalence relation. What’s the limiting case of such an equivalence 
relation? It will have to be 7 ? , the smallest equivalence relation containing R. 
That observation gives us the clue about how to proceed, still working in Set. 
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Theorem 40. Given functions f,g:X^Y in Set, let R~ be the smallest equiv- 
alence relation containing R - where y\Ry 2 iff (3a; € X)(f(x) = y\/\g(x) = 2/2) - 
Let C be Y/R~, i.e. the set of R~ -equivalence classes of Y ; and let c map 
y £ Y to the R~ -equivalence class containing y. Then [ C,c ], so defined, is a 
co-equalizer for f and g. 

Proof. First we need to check that c o / = c o g. But the left-hand side sends 
x £ X to the R~ -equivalence class containing f(x) and the right-hand side sends 
x to the -equivalence class containing g(x). However, f(x) and g(x) are by 
definition if-related, and hence are i?~-related: so by construction they belong to 

/ 

the same i?~ equivalence class. Hence X' <• Y — > C is indeed a co-fork. 

g^ 

f , t 

Now suppose there is another co-fork X ( Y — — > S We need to show 

the first co-fork will factor through this via a unique mediating arrow u. 

By assumption, k o / = / o g. And we first outline a proof that if y\R~ y 2 then 

Hyi) = k(y 2 ). 

Start with R defined as before, and let R' be its reflexive closure. Obviously 
we’ll still have that if y\R!y 2 then k(yi) = k(y 2 ). Now consider R" the symmetric 
closure of R': again obviously, we’ll still have that y\R"y 2 then k(yf) = k{y 2 ). 
Now note that if y\R"y 2 and y 2 R"ys, then k{yf) = So if we take the 

transitive closure of R" , we’ll still have a relation which, when it holds between 
some 2/1 and y 2 , implies that k{yf) = k(y 2 ). But the transitive closure of R" is 
R~. 

We have shown, then, that k is constant on members of a i?~-eciui valence class, 
and so we can well-define a function u: C — > S which sends an equivalence class 
to the value of k on a member of that class. This u is the desired mediating arrow 
which makes the diagram defining a co-equalizer commute. Moreover, since c is 
surjective and C only contains iZ~-equivalence classes, u is the only function for 
which u o c — k. □ 

In a slogan then: in Set, quotienting out by an equivalence relation is (up 
to unique isomorphism) the same as taking an associated co-equalizer. In many 
other categories co-equalizers behave similarly, corresponding to ‘naturally oc- 
curring’ quotienting constructions. But we won’t go into more detail here. 
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A terminal object is defined in terms of how all other objects in the category 
relate to it (by each sending it a unique arrow). A product wedge is defined in 
terms of how all other wedges in a certain family relate to it (each factoring 
through it via a unique arrow). An equalizing fork is defined in terms of how 
all other forks in a certain family relate to it (each factoring through it via a 
unique arrow). In an informal sense, terminal objects, products, and equalizers 
are limiting cases, defined in closely analogous ways. Likewise for their duals. 

In this chapter, we now formally capture what’s common to terminal objects, 
products and equalizers by defining a general class of limits, and confirming that 
terminal objects, products and equalizers are indeed examples. We also define a 
dual class of co-limits, which has initial objects, coproducts and co-equalizers as 
examples. We then give a new pair of examples, one for each general class, the 
so-called pullbacks and pushouts. 

9.1 Defining limit cones 

(a) Way back in Defn. 3, we characterized a diagram I? in a category as 
being simply a bunch of objects with some arrows between some of them. We 
need some way of indexing these objects to refer to them, so henceforth we’ll 
refer to objects in D by terms like ‘ D :) ' where j is in some appropriate index 
set. And for convenience, we’ll allow double counting, permitting the case where 
Dj = Dk for different indices. We will also allow the limiting cases where there 
are no arrows, and even the empty case where there are no objects. 

Definition 46. Let I? be a diagram in category . Then a cone over D comprises 
a ^-object C, the vertex of the cone, together with ^-arrows c j : C -» Dj, one 
for each object Dj in D, such that whenever there is an arrow d: Dk — > Di in 
D, the following diagram commutes: 


C 



We use ‘[C, Cj}’ as our notation for such a cone. <1 
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Think of it diagrammatically(!) like this: arrange the objects in the diagram D 
in a plane, along with whatever arrows there are between them in D. Now sit 
the object C above the plane, with a quiverful of arrows from C zinging down, 
one to each object Dj in the plane. Those arrows form a skeletal cone. And 
the key requirement is that any triangles thus formed with C at the apex must 
commute. 

We should note, by way of aside, that some authors prefer to say more 
austerely that a cone is not a vertex-object-with-a-family-of-arrows-from-that- 
vertex but simply a family of arrows from the vertex. Since we can read off the 
vertex of a cone as the common source of all its arrows, it is very largely a matter 
of convenience whether we speak austerely or explicitly mention the vertex. But 
for the moment, we’ll take the less austere line. 

There can be many cones, with different vertices, over a given diagram D. 
But, in the same spirit as our earlier definitions of products and equalizers, we 
can define a limiting case, by means of a universal mapping property: 

Definition 47. A cone over a diagram D in ^ is a limit (cone) over D iff 

any cone [ C,Cj ] over D uniquely factors through it, so there a unique mediating 
arrow it: C —t L such that for each index j, itjou = Cj. In other words, for each 
Dj , TV Di, . . . , in D, the corresponding triangle with other vertices C and T 
commutes: 


C 



(b) Let’s immediately confirm that our three announced examples of limits so 
far are indeed limit cones in the sense just defined. 

(1) We start with the null case. Take the empty diagram in V - zero objects 
and so, necessarily, no arrows. Then a cone over the empty diagram is 
simply an object C, a lonely vertex (there is no further condition to fulfil), 
and an arrow between such cones is just an arrow between objects in V. 
Hence T is a limit cone just if there is a unique arrow to it from any other 
object i.e. just if T is a terminal object in V! 

(2) Consider now a diagram which is just two objects we’ll call ‘TV, ‘TV, still 
with no arrow between them. Then a cone over such a diagram is just a 
wedge into TV TV and a limit cone is simply a product of Di with TV 
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(3) Next consider a diagram which again has just two objects, but now with 

d 

two parallel arrows between them, which we can represent D \ < D 2 . 

d! 

Then a cone over this diagram is a commuting diagram like this: 


C 



If there is such a diagram, then we must have doci = d! oc\ \ and vice versa, 
if that identity holds, then we can put C 2 = d o ci = d' o c\ to complete 
the commutative diagram. Hence we have a cone from the vertex C to our 

l / Cl t/ ^ 1 1 

diagram iff C > D\ < £>2 is a fork. Since C\ fixes what C 2 has 

d' 

to be to complete the cone, we can focus on the cut-down cone consisting 
of just [C, Ci]. 

What is the corresponding cut-down limit cone? It consists in [E, e] such 
there is a unique u such that ci = e o u. Hence [E, e] is an equalizer of 
d 

Di =£ D 2 . 

d! 

(c) We can now give a direct proof, along now familiar lines, for the predictable 
result 

Theorem 41. Limit cones over a given diagram D are unique up to a unique 
isomorphism commuting with the cones’s arrows. 

Proof. As usual, we first note that a limit cone [L, 7ij] factors through itself via 
the mediating identity 1l- L — > L. But by definition, a cone over D uniquely 
factors through the limit, so that means that 

(i) if 7r j o u = 7 Tj for all j, then u = 1 l- 

Now suppose [I/,7r'] is another limit cone over D. Then [L' . 7r'] uniquely factors 
through [L,-Kj\, via some /, so 

(ii) 7T j o f = 7 Tj for all j. 

And likewise [L,TTj\ uniquely factors through [L 1 , 7r'] via some g , so 

(iii) TTjOg = 7 Tj for all j. 

Whence 

(iv) 7r j o / o g = 7 Tj for all j. 

Therefore 
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(v) fog = 1 L . 

And symmetrically 

(vi) go f = 1 L ,. 

Whence / is not just unique (by hypothesis, the only way of completing the 
relevant diagrams to get the arrows to commute) but an isomorphism. □ 

9.2 Limit cones as terminal objects 

We have already seen that 

(1) A terminal object in Y is .. . wait for it! ... terminal in the given category 

7f. 

(2) The product of X with Y in Y is a terminal object in the derived category 
c Yw(x.y) of wedges to X and Y. 

(3) The equalizer of parallel arrows through X to Y in Y are (parts of) terminal 
objects in the derived category Y F ( X y) of forks through X to Y. 

Predictably, limit cones more generally are terminal objects in appropriate cat- 
egories. 

To spell this out, we first note that the cones [C, Cj\ over a given diagram D 
in Y form a category in a very natural way: 

Definition 48. Given a diagram D in category Y, the derived category Yq(d) ~ 
the category of cones over D - has the following data: 

(1) Its objects are the cones [C,Cj\ over D. 

(2) An arrow from [ C,Cj ] to [C . c' ] is any Harrow f:C — ► C' such that 

c 'j° f — c j for all indices j. In other words, for each Dj, D k , in D, 

the corresponding triangle with remaining vertices C and C' commutes: 

G 



The identity arrow on a cone [C, cf\ is the Y -arrow 1 q. And composition for 
arrows in Yq{d) is just composition of the corresponding ^-arrows. <1 

It is entirely routine to confirm that Yq(d) is indeed a category. We can then 
recast our earlier definition of a limit cone as follows: 
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Definition 49. A limit cone for D in If is a cone which is a terminal object in 
^ C(D )■ < 

And we now have an alternative proof of our last theorem, that limit cones 
over a given diagram D are unique up to unique isomorphism commuting with . 

Proof. Since a limit cone over D is terminal in ^ 0 ( 0 ), it is unique in Ifcyo) 
up to a unique isomorphism. But such an isomorphism in ^c{D) must be an 
isomorphism in If commuting with the cones’s arrows. □ 

9.3 Results about limits 

(a) Let’s first prove two further simple theorems: 

Theorem 42. Suppose [ L , ttj] is a limit cone over a diagram D in If, and [V , 7 r'] 
is another cone over D which factors through [L,ttj\ via an isomorphism f. Then 
[I/,7r'-] is also a limit cone. 

Proof. Take any cone [ C , Cj] over D. We need to show that (i) there is an arrow 
v \ G I! such that for all indexing objects j in D , Cj = 7 r' o v, and (ii) v is 
unique. 

But we know that there is a unique arrow u : C — » L such that for j, Cj = ttjou. 
And we know that / : L' — > L and 7r' = ttj o / (so ttj = 7r' o / -1 ). 

Therefore put !) = / _1 o«, and that satisfies (i). 

Now suppose there is another arrow v l : C — > Ii such that Cj = 7r' o v'. Then 
we have fov' : C — > L, and also Cj = ttjo f ov’ . Therefore [C, cf\ factors through 
[. L , 7 tj\ via fov 1 , so / ov' = u. Whence v' = f~ x o u = v. Which proves (ii). □ 

Theorem 43. Suppose [L,7Tj] is a limit cone over a diagram D in . Then the 
cones over D with vertex C correspond one-to-one with & -arrows from C to L. 

Proof. Take any arrow u : C — > L. If there is an arrow d : Dk — > Di in the diagram 
D , then (since [L, ttj] is a cone), 717 = d o 717 ,, whence (ttiou) = do ( 7 1 }~ o u). Since 
this holds generally, [ C , tt j oh] is a cone over D. But (again since [L, 7 r,-] is a limit) 
every cone over D with vertex C is of the form [C, 7 tj o u] for unique u. Hence 
there is indeed a one-one correspondence between arrows u: C — > L and cones 
over D with vertex C. (Moreover, the construction is a natural one, involving 
no arbitrary choices.) □ 

(b) As a fun exercise and reality check, let’s remark that the whole category 
can be thought of as the limiting case of a diagram in itself, and then 

Theorem 44. A category ‘rf has an initial object if and only if , thought of as 
a diagram in , has a limit. 
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Proof. Suppose 7 ? has an initial object I. Then for every ^-object C, there is a 
unique arrow 7 tq- [I, ^ re] is a cone (since for any arrow f : C D, the composite 
/ o 7 tc is an arrow from I to D and hence has to be equal to the unique 7 r£>. 
Further, [I, ttc] is a limit cone. For suppose [ A , ac\ is any other cone over the 
whole of ^ . Then since it is a cone, the triangle 


A 


I 




■> C 


has to commute for all C. But that’s just the condition for [A, ac] factoring 
through [I, ttc] via a/. And moreover, suppose [A, ac] also factors through by 
some u. Then in particular, 


A 



commutes, and so u = ac- So the factoring is unique, and [I, ttc] is a limit cone. 

Now suppose, conversely, that [/, irc\ is a limit cone over the whole of . Then 
there is an arrow 7 tq ■ I — t C for each C in . If we can show it is unique, / will 
indeed be initial. 

Suppose then that there is an arrow k: I — > C for a given C. Then since 
[/, 7 rc] is a cone, the diagram 


I 



1 k 

has to commute. Considering the case where k = nc , w e see that [I,ttc\ factors 
through itself via 7 17; but it also factors via Id, so the uniqueness of factorization 
entails nj = Id- Hence the diagram shows that for any k: I —> C has to be 
identical to nc- So I is initial. □ 


(c) Before proceeding further, let’s introduce some standard notation: 

Definition 50. We denote the limit object at the vertex of a limit cone for the 
diagram D with objects Dj by l LimDj\ <\ 

Do note, however, that since limit cones are only unique up to isomorphism, 
different but isomorphic objects can be denoted in different contexts by l Lim DP. 

-t—J 

The projection arrows from this limit object to the various objects Dj will 

then naturally be denoted ‘7 : Lim Dj —> Di\ and the limit cone could therefore 

4-3 
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be represented by l [Lim Dj , 7 r,-]’. (The direction of the arrow under L Lim' in this 
*~3 

notation is perhaps unexpected, but we just have to learn to live with it.) 

9.4 Colimits defined 

The headline, and thoroughly predictable, story about duals is: reverse the rel- 
evant arrows and you get a definition of colimits. 

So, dualizing §9.1 and wrapping everything together, we get: 

Definition 51. Let D be a diagram in category ^ . Then a cocone under D is 
a ^-object C, together with an arrow Cj : Dj — ► C for each object Dj in D, 
such that whenever there is an arrow d: D k — > Di in D, the following diagram 
commutes: 


Dk 



> Di 



The cocones under D form a category with objects the cocones [ C,Cj ] and an 
arrow from [ C,Cj ] to [C',c'] being any Harrow /: C C' such that c' = / oc, 
for all indexes j. A colimit for D is an initial object in the category of cocones 
under D. It is standard to denote the object at the vertex of the colimit cocone 
for the diagram D by l LimDj\ <\ 

It is now routine to confirm that our earlier examples of initial objects, co- 
products and co-equalizers do count as colimits. 

(1) The null case where we start with the empty diagram in gives rise to a 
cocone which which is simply an object in C. So the category of cocones 
over the empty diagram is just the category ^ we started with, and a limit 
cocone is just an initial object in c tf\ 

(2) Consider now a diagram which is just two objects we’ll call ‘Hi’, l D 2 \ still 
with no arrow between them. Then a cocone over such a diagram is just 
a corner from D\,D 2 (in the sense we met in §7.6); and a limit cocone in 
the category of such cocones is simply a coproduct. 

(3) And if we start with the diagram D\ \ D 2 then a limit cocone over 

d' 

this diagram gives rise to a co-equalizer. 


9.5 Pullbacks 

(a) Let’s illustrate all this by briefly exploring another kind of limit (in this 
section) and its dual (in the next section). 
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A co-wedge or, as I prefer to say, a corner D in category ^ is a diagram which 
can be represented like this: 

D 3 

l e 

D 1 D 2 


Now, a cone over our corner diagram has a rather familiar shape, i.e. it is a 
commutative square: 


c — ' 3 > d 3 



Di — » D 2 


Though note, we needn’t really draw the diagonal here, for if the sides of the 
square commute thus ensuring d o c\ = e o c 3 , then we know the diagonal c 2 
exists making the triangles commute. 

And a limit for this type of cone will be a cone with vertex L = Lim Dj and 

<~3 

projections 7 r j : L — > Dj such that for any cone [C, Cj] over D, there is a unique 
u: C — > L such that this whole diagram commutes: 



(And note that if this commutes, there’s just one possible tt 2 : L D 2 and 
c 2 : C — » D 2 which makes the diagram still commute.) 

Definition 52. A limit for a corner diagram is a pullback. The whole square 
formed by the original corner and its limit is a pullback square. <3 


(b) Let’s immediately have some examples of pullback squares living in the 
category Set. 

(1) Changing the labelling, consider a corner comprising three sets X, Y, Z and 
a pair of functions which share the same codomain, thus: 

Y 
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We know from the previous diagram that the limit object L must be 
product-like (with any wedge over X, Y factoring through the wedge with 
vertex L). Hence to get the other part of the diagram to commute, the 
pullback square must have at its apex L something isomorphic to { (or, y) £ 
X x Y | f(x) = g{y)} with the obvious projection maps to X and Y. 

So suppose first that in fact both X and Y are subsets of Z, and the 
arrows into Z are both inclusion functions. And we then get a pullback 
square 


L > Y 

i I* 

X — — > Z 

Then L = {(x,y) £ X x Y \ x = y} = {(z, z) \ z £ XnY}= XnY. Hence, 
in Set, the intersection of a pair of sets is their pullback object (fixed, as 
usual, up to isomorphism). 

(2) Take another case in Set. Suppose we have a corner as before but with 
Y = Z and g = lz- Then 

L = {{x, z) £ X x Z | f(x) = z} = {x | 3zf(x) = z} = / _1 [Z], 

i.e. a pullback object for this corner is, up to isomorphism, the inverse 
image of Z , and we have a pullback square 

f~\Z\ * Z 

1 z 

4 4 

X > Z 

Hence in Set, the inverse image of a function is also a pullback object. 

(3) Revisiting §8.3, we see that we in effect defined a subset SC I in Set as 
a pullback object for a corner 


1 

t 

X ► {0, f } 

(c) Why ‘pullback’? Look at e.g. the diagram in (2). We can say that we get 
to / _1 [Z] from Z by pulling back along / - or more accurately, we get to the 
arrow f~ l \Z\ — > X by pulling back the identity arrow on Z along /. 

In this sense, 

Theorem 45. Pulling back a monomorphism yields a monomorphism 
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In other words, if we start with the same corner X — - — > Z < — - — Y with g 
rnonic, and can pullback g along / to give a pullback square 

L b - > Y 

a g 

i i 

x — - — > z 

then the resulting arrow a is monic. (Note, this does not depend on the character 

of /•) 

Proof. Suppose, for some arrows C \ L, a o j = a o k. Then g o b o j = 

k 

f o a o j = foaok = gobok. Hence, given that g is monic, bo j = b o k. 

It follows that the two cones over the original corner, X < a ° 3 (j; C b ° J > Y 

and X < aok C bok > Y are in fact the same cone, and hence must factor 
through the limit L via the same unique arrow C — > L. Which means j = k. 

In sum, a o j = a o k implies j = k, so a is monic. □ 

Here’s another result about monomorphisms and pullbacks: 

Theorem 46. The arrow f : X Y is a monomorphism in Y? if and only if the 
following is a pullback square: 

X > X 

lx / 

X — > Y 

Proof. Suppose this is pullback diagram. Then any cone X — C — - — > Y 

f f 

over the corner X > Y < X must uniquely factor through the limit 

with vertex X. That is to say, if foa = fob , then there is a u such that a = 1 x°u 
and 6 = I.y ° u, hence a = b so / is monic. 

Conversely, if / is monic, then given any cone X < — - — C — - — f Y over the 
original corner, / o a = / o b, whence a — b. But that means the cone factors 
through the cone X ^ — X lx > X via the unique a, making that cone a 
limit and the square a pullback square. □ 

(d) We’ve explained, up to a point, the label ‘pullback’. It should now be noted 
in passing that a pullback is sometimes called a fibered product (or fibre product) 
because of a construction of this kind on fibre bundles in topology. Those who 
know some topology can chase up the details. 
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But here’s a way of getting products into the story, using an idea that we 
already know about. Recall the definition of the slice category fZ. Its objects 
are arrows^ / : C — > Z where C is an object in ^ 7 , and an arrow from / : X — ► Z 
to g: Y — > Z is an arrowy h: X -^-Y such that / = g o h in . 

Now the pullback of the corner formed by / and g in ^ is a pair of arrows 
a: L — » X and b: L -* Y such that f o a = g o b (= k) and which form a wedge 
such that any other wedge a' : L' — ► X , b' : L' — ► Y such that foa! = gob' (= k') 
factors uniquely through it. 

Looked at as a construction in c to /Z : this means taking two 'tf/Z - objects / 
and g and getting a pair of *€ I Z - arrows a: k f, b: k g (check that a is 
indeed a ^ I Z - arrow from foa to /!). And this pair of arrows forms a wedge 
such that any other wedge a' : k' — > f, b' : k! — > g factors uniquely through it. In 
other words, the pullback in ^ is a product in ^ fZ. 

Product notation is often used for pullbacks, thus: 



with the little corner conventionally indicating it is indeed a pullback square 

9.6 Pushouts 

As you would expect, the account of pullbacks dualizes. Suppose we take a 
wedge D , i.e. a diagram like this: D\ — — > D3. A cocone under 

this diagram is another commutative square (omitting again the diagonal arrow 
which is fixed by the others). 



And a limit cocone of this type will be a cocone with apex L = Lim Dj and 

projections 7 tj : L — > Dj such that for any cocone [G, Cj] under D , there is a 
unique u: L — » C such that the obvious dual of the whole pullback diagram 
above commutes. 

Definition 53. A limit for a wedge diagram is a pushout. <\ 

Now, in Set, we get the limit object for a corner diagram X — - — > Z <— - — Y 
by taking a certain subset of a product X x Y . Likewise we get the colimit object 

for a wedge diagram X < — — Z — - — > Y by taking a certain quotient of a 
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coproduct XHY. (Recall from our discussion of (co)-equalizers that quotients 
are categorially dual to subsets in Set.) In fact, we need to quotient out by the 
smallest equivalence relation containing the relation R where (x, 0) R(y, 1) iff 
there is a 2 such that f(z) = x A g(z) = y. We won’t, however, pause further 
over this now. Though it does illustrate how taking colinrits can tend to beget 
messier constructions than taking limits. 
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We have seen that a whole range of familiar constructions from various areas of 
ordinary mathematics can be regarded as instances of taking limits or colimits 
of (very small) diagrams in appropriate categories. Examples so far include: 
forming cartesian products or logical conjunctions, taking disjoint unions or free 
products, quotienting out by an equivalence relation, taking intersections, taking 
inverse images. 

Not every familiar kind of ‘universal’ construction involves taking (co)limits. 
We’ll see in the next chapter that exponentiation involves a different idea. But 
plainly we are mining a very rich seam here (and also beginning to make good on 
our promise to show how category theory helps reveal recurring patterns across 
different areas of mathematics). 

It would get tedious, however, to explore what it takes for a category to have 
limits for various further kinds of diagram case-by-case, even if we just stick to 
considering limits over very small diagrams. But fortunately we don’t need to 
do such a case-by-case examination. It turns out that if a category has certain 
basic limits, then it has, in a sense to be clarified, all limits. 

10.1 Pullbacks, products and equalizers related 

(a) Here’s an obvious definition: 

Definition 54. The category has all finite limits if for any finite diagram D 
- i.e. for any diagram whose objects are Dj for indices j € /, where / is a finite 
set - 'rf has a limit over D. A category with all finite limits is said to be finitely 
complete. <3 

Our main target theorems for this chapter are then as follows: 

Theorem 47. If has a terminal object, and has all binary products and equal- 
izers, it is finitely complete. 

Theorem 48. If has a terminal object, and has a pullback for any corner, it 
is finitely complete. 
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Later, we will see how to get an analogous result for limits over infinite diagrams; 
but it will help fix ideas if we initially focus on the finite case. And of course, 
our theorems will have the predictable duals: we will return to them in §10.5. 

We begin, in this section, by proving the following much more restricted ver- 
sions of these theorems: 

Theorem 49. If a category Y? has binary products and equalizers, then it has a 
pullback for any corner. 

Theorem 50. If# has a terminal object, and has a pullback for any corner, then 
it has all binary products and hence all equalizers. 

This will have a double pay-off. First, it means that we only need prove one of 
Theorems 47 and 48, since in the presence of Theorems 49 and 50 they imply 
each other: we will concentrate on proving Theorem 47. Second, our proof of 
Theorem 49 will provide a very instructive guide to how to do that. 

(b) For those rather nobly trying, as we go along, to prove stated theorems 
before looking at the proofs, the results in this chapter do require a little more 
thought than what’s gone before. Though a little exploration should reveal the 
only reasonable proof-strategies. 

Proof for Theorem 49- Given an arbitrary corner X — - — > Z < — - — Y we need 
to construct a pullback. 

There is nothing to equalize yet. So our only option is try products. Because Y 
has binary products, there will in particular be a product X x Y with the usual 
projections 7Ti : XxY — > X and 7Ti : X x Y — »• Y . But that this immediately gives 

/°7Tl, ^ _ 

us parallel arrows XxY ] Z. Because has equalizers, this parallel pair 

9 °7T2 

must have an equalizer [E, e]. So the square and its two internal triangles in the 
following diagram all commute: 



Claim: the wedge formed by E with the projections ni o e, n 2 o e is indeed a 
pullback of the original corner. 


80 



10.1 Pullbacks, products and equalizers related 


From this point, the argument is just fairly routine checking. Consider any 
other cone over the original corner, i.e. consider any wedge X -e— ^ — C — — — > Y 
with fc\ = gc 2 . We need to show that this factors uniquely through E. 

Now, that wedge certainly uniquely factors through the product X x Y, so 
there is a unique u: C — > X x Y such that ci = o u, C 2 = ^2 ° u. Hence 
„ „ / 07r 1, „ 

/ o 7n o u = go'K 2 °u. Therefore C > X x Y ( Z is a fork, which must 

gon 2 

factor uniquely through the equalizer E via some v. 

That is to say, there is a v : C — > E such that e o v = u. Hence 7Ti o e o v = 
7Ti o u = ci. Similarly tt 2 o e o v = C 2 . Therefore the wedge with vertex C indeed 
factors through E, as we need. 

To finish the proof, we have to establish the uniqueness of the mediating 
arrow v. Suppose v' : C — > E also makes 7Ti o e o v' = c\, ir 2 o e o v' = C 2 - 
But e o v' : C — > X xY, and we know the wedge X ^ — — C — > Y factors 
uniquely through XxY,soeov' = u = eov. But equalizers are rnonic, so 
v' — v. □ 

(c) Next, to prove Theorem 50 on getting products from pullbacks, think first 
about a special case oi XxzY. Then, to prove the theorem’s claim about getting 
equalizers from products and pullbacks, try reconfiguring the parallel arrows you 
want to equalize as a wedge; take the product this wedge factors through, and 
use the fact that a certain corner involving this product has a pullback (the trick 
is to spot the right corner to use here). 

Spelling things out, we have: 

Proof for Theorem 50. Think about X Xj Y. In other words, note that since ^ 

has a terminal object, for any X, Y, there exists a corner X > 1 < Y. 

Applying the definition of a pullback, we find that a pullback for such a corner 
is indeed just the product X x Y with its usual projection arrows. 

To show that has equalizers, given that it has pullbacks and hence products, 

/ 

start by thinking of the parallel arrows we want to equalize, say X ( Y, as a 

wedge Y < — — X — — > Y. This wedge will factor uniquely via (/, g) through 
the product Y xY. 

So now consider the corner X ^’ 9 \ Y x Y e— 1 — Y, where Sy is the ‘diag- 
onal’ arrow (see Defn. 30). This is nice to think about since (to arm-wave a bit!) 
the left arrow is evidently related to the parallel arrows we want to equalize, and 
the right arrow does some equalizing. Take this corner’s pullback: 

E 1 > Y 

J 

e Sy 

X — — > Y x Y 
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Intuitively speaking, E — - — > X Y x Y sends something in if to a pair 

of equals. So, morally, [E,e] ought to be an equalizer for X ) Y. And, 

from this point on, it is a routine proof to check that it indeed is an equalizer. 
Here goes: 

By the commutativity of the pullback square, Sy ° q = (/, g) ° e. Appealing 
to Theorems 24, 26 and 27, it follows that (q,q) = (f o e,g o e), and hence 

e / , 

foe = q = goe. Therefore E — - — > X ( Y is a fork. It remains to show 
that it is a limit fork. 

Take any other fork C — - — > X ] Y. The wedge X < — - — C ^ > Y 

9 9 

must factor through E (because E is the vertex of the pullback) via a unique 
mediating arrow v: 



Rearranging part of this diagram, and unpacking again the composites foc,goc, 
this is just to say that there is a unique arrow v: C E making this cut-down 
diagram commute: 



So [E, e] is indeed an equalizer. 


□ 


10.2 Set has all finite limits 

Our target now is to show that if a category Yt has a terminal object and binary 
products (so has all finite products) and has equalizers, then it has a limit over 
any finite diagram D. 

This is indeed our first Big Result in the book. To prove it, we are going to 
generalize the strategy pursued in proving the cut-down Theorem 49 that showed 
that having binary products and equalizers implies at least having pullbacks. So: 
we start with a product P of objects from D. We then find some appropriate 
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parallel arrows out of this product P (here using arrows from D). Then we take 
an equalizer which will be a monic arrow from a subobject E of D, and use this 
subobject E as the vertex of the desired limit over D. 

The devil, of course, is in the details! And in fact, you won’t lose much if you 
skip past them. But assuming you want to understand the full proof, it might 
well help illuminate what’s going on in the abstract general proof if we first 
consider how the proof works for a special category, Set. For here we can rely 
on a concrete understanding of finite products as sets of finite tuples, and of 
equalizers as subsets. So in this section, we will establish 

Theorem 51. Set has all finite limits. 

In this case, our proof suggestion comes to this. Take a product of the objects 
in a given diagram D , which we can take to be a set of tuples equipped with the 
trivial projection arrows which pick out components of the tuples. Then, we will 
directly find E, a subset of the product set, which can be used as the vertex of 
our limit cone, keeping only those projection arrows that commute with arrows 
in D. Working that idea through, we have: 

Proof. We are given a finite diagram D in Set, whose objects are sets Dj for j £ I 
(where the index set / is finite, so we might as well assume it to be the first n 
natural numbers for some n). There are perhaps functions as arrows between 
some or all of these Dj. The task is to construct a limit over D. 

First take the finite product [P,Pj] of all the sets Dj in the standard way. 
So we can take P to be the set of all n-tuples x = (a where each tuple- 
component Xj is a member of the corresponding Dj. And then each function 
Pj : P — > Dj will project out the j-th element of a tuple, so Pj(x) = Xj. 

In general, however, [P,Pj\ won’t be a limit cone over D. For D may contain 
arrows between objects, and we’ve done nothing to ensure that the arrows pj 
appropriately commute with the arrows in D in the way required for a cone. 

Our announced strategy, then, is to look for a suitable subset of E C P. And 
if E is to be the vertex of a cone, we need to equip it with arrows 7 Tj : E — > Dj . 
Since E is a set of n-tuples of elements of the Dj , the natural candidates will 
still simply project out the elements, so for x € E, 7T ? (x) = x :r In effect we just 
cut down each pj from the domain P to the domain E. Or to put that more 
carefully, if e: E — > P is the obvious injection map here, then 7 Tj = pj o e. 

To get a cone, we need the relevant projection arrows to commute with any 
arrow in D. So if, in particular, we have d: Dk Di in d, then we require 
7q —do 7Tfc ; that is to say, for any tuple x € E, d(xk) = And of course we 
require a similar condition for each arrow in the diagram D. 

In summary so far, then, here’s one way of getting a cone [E,ttj\ over D: 

( 1 ) Put E to be the set of tuples x = (xj)j e i such that Xj £ Dj for all j & I 
and for every arrow d: D *. — > Di the corresponding condition d(xk) = Xi 
is satisfied. 
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(2) Put 7 Tj(x) = Xj (i.e. put 7 tj = pj oe, where e: £ — > P is the injection map). 

Since we’ve constructed this cone by cutting down another limit in the most 
natural and economical way possible, it morally ought to be a limit cone. 

Which it is! Suppose [ C,Cj\ is any cone over D. Define the map u: C — > E 
as sending c € C to the tuple (cj(c))j^i. Then Cj = 7r j o u and so [C, Cj\ factors 
through [E,nj] via u. And u is unique. For if [C,Cj] also factors through [E,nj\ 
via u', then 7 Tj o v! = Cj, therefore for any c the j-th component of v!(c) = Cj(c), 
hence u' = u. □ 

Note, since everything in that proof is finite it also establishes that FinSet also 
has all finite limits. 


10.3 The existence of finite limits, more generally 

The proof in the last section almost wrote itself once we had the key idea of first 
taking a product [P. Pj] of the objects of D , and then (pretty much by brute 
force) extracting a subset of E C P containing just those tuples which satisfy 
the condition necessary to get a cone with vertex E over D. However, our proof 
talked about tuples, subsets etc. in an old-school, non-categorial way. Let’s see 
if we can rethink the proof, continuing to work for the moment in Set but now 
putting things in more purely categorial terms. 

So again we take the product [P.pj\ of the objects of D (we can do this since 
Set has all finite products). And in headline terms, we know what we need to do 
next - we must get the subset E by defining it as an equalizer. But an equalizer 
for what? 

Take another look at the proof of Theorem 49. There we started with a corner 
diagram, i.e. with two arrows sharing a target, /: X — ► Z, g: Y — ► Z. We got 
parallel arrows which share a source as well as a target by taking a product, 

/°7T 

thereby getting X x Y ) Z. And then we could look for an equalizer. 

go-rr 2 

Now, in a diagram D there could be lots of arrows of the kind d: Dj. Di, 
and this time the arrows we need to deal with may well have different targets. 
But we still want to end up with a pair of parallel arrows with the same target 
if we are going to be able to take an equalizer. We’ll evidently have to go from 
many targets to a single target by taking a product again. So let’s define [Q, q{\ 
as the product of all the objects Di which are targets for arrows in D (again, we 
can do this since Set has all finite products). 

The name of the game is now to define a pair of parallel arrows 

V 

P Q 

w 

which we are going to equalize by some [E, e], and we aim to do this so that, in 
Set, E is the set of tuples which we met in the proof in the last section. 
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However, there are in fact only two naturally arising arrows from P to Q. 
Consider first the cone with vertex P and with an arrow pp. P Di for each 
Di which contributes to the product Q. This cone (by definition of the product 
[Q, qi ]) must factor through the product by a unique mediating arrow v. so that 
Pi = qi ° v for each l. 

Consider secondly the cone with vertex P and an arrow d o p k : P — y Di for 
each arrow d: D k — > £)/ in D. This cone too must factor through the product 
[Q, qi] by a unique mediating arrow w, so that d o p k = qi o w for each arrow 
d : D k — > Di . 

Since all parallel arrows have equalizers in Set, we can take the equalizer of v 
and w, namely [E, e]. Then E will be, as we want, the subset of tuples of objects 
from the Dj such that (for any arrow d: D k — > Di) d(x k ) = Xi. 

And now the big claim: as before, [E,pj o e] will be a limit cone over D. Let’s 
state this as a theorem: 


Theorem 52. Let D be a diagram in Set. Let [P, pj) be the product of the objects 
Dj in D, and [ Q , qi ] be the product of the objects Di which are targets of arrows 
in D. Then there are arrows 

V 

P = =£ Q 

w 

such that the following diagrams commute for each d: D k — > D\: 


P Q 

c<"\, J" 

Di 

Let the equalizer of v and w be [E, e] . 
D in Set. 


P 


Dk 


<5 


-»• D, 


Then [E, pj o e] will be a limit cone over 


Proof. We have already defined v and w so that the given diagrams commute. 
So next we confirm [E,pj o e] is a cone. Suppose there is an arrow d: D k — > D[. 
Then we reciuire d o p k o e = pi o e. 

But indeed dop k oe = qiowoe = qiovoe = pioe, where the inner equation 
holds because e is an equalizer of v and w and the outer equations are given by 
the commuting diagrams above. 

Second we show that [E,pj o e] is a limit. So suppose [C, cf] is any other cone 
over D. Then there must be a unique u: C P such that every Cj factors 
through the product and we have Cj = Pj o u. 

Since [ C,cj\ is a cone, for any d: D k —t Di in D we have d o c k = Cj. Hence 
dop k ou = piou, and hence for each qi , qiowou = qiovou. But then we can apply 
the obvious generalized version of Theorem 25, and conclude that w o u = v o u. 
Which means that 

C ^ — > P P l Q 

Q 
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is a fork, which must therefore uniquely factor through the equalizer [ E , e] . That 
is to say, there is a unique s: C — >• E such that u = eos, and hence for all j, 
c j = pj o u = pj o e o s. That is to say, [C, Cj ] factors uniquely through [E,pj o e] 
via s. Therefore [E,pj o e] is indeed a limit cone. □ 

So that gives us the desired purely categorial proof, without mentioning sub- 
sets, of our result that Set has all finite limits. But now note that we didn’t depend 
on anything but the fact that Set has finite products and equalizers. And since 
having a terminal object plus binary products sufficies for all finite products, we 
- at last! - have the originally promised sweeping result: 

Theorem 47. If < £ has a terminal object, and has all binary products and 
equalizers, it is finitely complete. 

Given ingredients from our previous discussions, since the categories in ques- 
tion have terminal objects, binary products and equalizers, 

Theorem 53. The categories of algebraic structured sets such as Mon. Grp, Ab, 
Rng are all finitely complete. Similarly Top is finitely complete. 

While e.g. a poset-as-a-category may lack many products and hence not be 
finitely complete. 


10.4 Infinite limits 

Now we extend our key Theorem 47 to reach beyond the finite case. First, we 
need: 

Definition 55. The category ^ has all (small) limits if for any diagram D whose 
objects are Dj for indices j £ /, for some set /, then has a limit over D. A 
category with all (small) limits is also said to be complete. < 

We are no longer stipulating, then, that the collection of indices for an object 
in a diagram be finite. We only require now that it is no bigger than set-sized. 
So, to repeat a remark we’ve made before, ‘small’ doesn’t mean small by any 
normal standards! - it just indicates that the relevant diagrams are not so mind- 
bogglingly enormous that they can’t be indexed by a set. 

Inspection of the proof in the last section shows that, so long as the diagram D 
is small (set-sized) , then the argument will continue to go through if we are still 
dealing with a category like Set which has products for all set-sized collections 
of objects, or as we say, has all small products. Hence, without further ado, we 
get: 

Theorem 54. If has all (small) products and has equalizers, then it has all 
small limits, i.e. is complete. 

We can similarly extend Theorem 53 to show that 
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10.5 Dualizing again 


Theorem 55. The categories of structured sets Mon, Grp. Ab. Rng (among oth- 
ers) are all complete. Top too is complete. 

We have already met a category which, by contrast, is finitely complete but is 
evidently not complete, namely FinSet. 


10.5 Dualizing again 

Needless to say by this stage, our results in this chapter dualize in obvious ways. 
Thus we need not delay over the further explanations and proofs of 

Theorem 56. If if has initial objects, binary coproducts and co-equalizers, then 
it has all finite colimits, i.e. is finitely cocomplete. If if has all coproducts and 
has co-equalizers, then it has all (small) colimits, i.e. is cocomplete. 

Theorem 57. Set is cocomplete - as are the categories of structured sets Mon, 
Grp. Ab, Rng. Top too is cocomplete. 

But note that a category can of course be (finitely) complete without being 
(finitely) cocomplete and vice versa. For a generic source of examples, take again 
a poset (P, =4) considered as a category. This automatically has all equalizers 
(and coequalizers) - see §8.1 Ex. (5). But it will have other limits (colimits) 
depending on which products (coproducts) exists, i.e. which sets of elements 
have suprema (infima). For a simple case, take a poset with a maximum element 
and such that every pair of elements has a supremum: then considered as a 
category it has all finite limits (but maybe not infinite ones). But it need not 
have a minimal element and/or infima for all pairs of objects: hence it can lack 
some finite colimits despite having all finite limits. 
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